Has anyone tried fine-tuning on framework-specific toolsets?
r/LocalLLaMA
•
Machine Learning
Open Source AI
AI Research
One setback of smaller local models seems to be their reliability in calling tools for the harness they're plugged into. I personally tried out Gemma 4 with Hermes Agent, and Gemma kept ignoring Hermes' tools - for example, it kept trying to call the 'google-search' tool it was trained with instead of the web-search tool it was instructed to use. I have never fine tuned and don't know much about it, but is this something that can be improved through fine tuning? Say, tuning the model specifically on Hermes tool calls.