Enable TogetherAI api to output reasoning traces consistently #2065

shanghongsim · 2025-12-01T21:23:27Z

Description

Consume reasoning argument of ChatCompletions object
Reasoning/hybrid models like Kimi K2, DeepSeek v3.1, GPT OSS 120B returns thinking traces in the reasoning argument of ChatCompletionMessage object.

ChatCompletionMessage(content='Of course! The capital of France is **Paris**.', refusal=None, role='assistant', annotations=None, audio=None, function_call=None, tool_calls=[], reasoning="Hmm, the user is asking a straightforward factual question about the capital of France. This is a common knowledge query with a clear answer. \n\nI recall that Paris is the capital and has been for centuries, so I can confirm that directly. Since the question is simple, no additional context or elaboration is needed unless the user asks for more. \n\nI'll keep the response concise and accurate, just stating the answer with a brief mention of its historical status to add slight value without overcomplicating it.")

Currently, Oumi conversations only consumes the content argument of ChatCompletionMessage object. This PR modifies TogetherAI inference engine API to concatenate reasoning and content fields of the ChatCompletionMessage for the content field of oumi Conversations output.

Enable reasoning in API
Hybrid models like DeepSeek-V3.1 need "reasoning": {"enabled": True}, to be passed in via API input to turn on reasoning. This PR adds optional api_kwargs to RemoteParams to allow users to pass in such arguments.

Related issues

Fixes # (issue)

Before submitting

This PR only changes documentation. (You can ignore the following checks in that case)
Did you read the contributor guideline Pull Request guidelines?
Did you link the issue(s) related to this PR in the section above?
Did you add / update tests where needed?

Reviewers

At least one review from a member of oumi-ai/oumi-staff is required.

wizeng23 · 2025-12-02T00:19:52Z

I don't see any logic changes in this PR, only test changes

wizeng23 · 2025-12-02T19:19:32Z

src/oumi/inference/together_inference_engine.py

+        )
+
+    @override
+    def _convert_conversation_to_api_input(


Per DRY, and to make sure this implementation doesn't diverge from the parent class over time, could you call super._convert_conversation_to_api_input() and then add any modifications you need after that? Ditto for _convert_api_output_to_conversation.

wizeng23 · 2025-12-03T00:38:34Z

src/oumi/core/configs/params/remote_params.py

+    api_kwargs: Optional[dict[str, Any]] = None
+    """Additional keyword arguments to pass to the API.
+
+    This allows for passing any API-specific parameters that are not


Could you add a comment here that this is currently only used for the together inference engine?

wizeng23 · 2025-12-03T00:39:51Z

src/oumi/inference/together_inference_engine.py

+        Returns:
+            Conversation: The conversation including the generated response.
+        """
+        try:


I don't quite get what this try-except is trying to do. If the try-block fails, then super()._convert_api_output_to_conversation will also fail because it will also try to extract message the exact same way right?

wizeng23 · 2025-12-03T00:49:46Z

src/oumi/inference/together_inference_engine.py

+                response, original_conversation
+            )
+
+        if "reasoning" in message and "</think>" not in message.get("content", ""):


Do you know if any other inference provider may include a "reasoning" field? If this is a common convention, we should move the logic in this function into the parent class RemoteInferenceEngine.

wizeng23 · 2025-12-03T00:52:51Z

src/oumi/inference/together_inference_engine.py

+
+        # Then layer on Together-specific / remote-specific kwargs
+        remote_params = self._remote_params
+        if remote_params.api_kwargs:


Ditto here; do you know of any other inference engines that could benefit from this? Should this logic be in RemoteInferenceEngine?

shanghongsim added 2 commits December 1, 2025 20:42

changes to enable togetherai api to output reasoning

aa399cc

add tests

511d668

shanghongsim requested review from oelachqar, taenin and wizeng23 December 1, 2025 21:23

wizeng23 requested changes Dec 2, 2025

View reviewed changes

apply DRY principles

c87d711

shanghongsim requested a review from wizeng23 December 3, 2025 00:09

wizeng23 reviewed Dec 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable TogetherAI api to output reasoning traces consistently #2065

Enable TogetherAI api to output reasoning traces consistently #2065

Uh oh!

shanghongsim commented Dec 1, 2025

Uh oh!

wizeng23 commented Dec 2, 2025

Uh oh!

wizeng23 Dec 2, 2025

Uh oh!

wizeng23 Dec 3, 2025

Uh oh!

wizeng23 Dec 3, 2025

Uh oh!

wizeng23 Dec 3, 2025

Uh oh!

wizeng23 Dec 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Enable TogetherAI api to output reasoning traces consistently #2065

Are you sure you want to change the base?

Enable TogetherAI api to output reasoning traces consistently #2065

Uh oh!

Conversation

shanghongsim commented Dec 1, 2025

Description

Related issues

Before submitting

Reviewers

Uh oh!

wizeng23 commented Dec 2, 2025

Uh oh!

wizeng23 Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

wizeng23 Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

wizeng23 Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

wizeng23 Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

wizeng23 Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants