When using some of the smaller, local models, sometimes they can't handle the more complex structured outputs you might want to use. It should be possible to access the raw response of the model for debugging purposes, or if you just want to interact with it in an unstructured way.