Skip to content
View ScienceOne-AI's full-sized avatar

Block or report ScienceOne-AI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning

Jupyter Notebook 216 14 Updated Jun 21, 2024

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 693 42 Updated Nov 4, 2025

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Python 1,486 87 Updated Oct 24, 2025

磐石科学基础大模型API

Python 4 Updated Jul 29, 2025

AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead of always thinking or never thinking, the model learns when …

Python 42 3 Updated Oct 14, 2025

Train your Agent model via our easy and efficient framework

Python 1,606 149 Updated Nov 3, 2025

[AAMAS'25] Code for "Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model"

Python 10 2 Updated May 27, 2025

[AAAI'25] Code for "In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning"

Python 9 1 Updated Jul 17, 2025

Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs

Python 16 Updated Mar 20, 2025

This repository implements AutoThink in our paper: Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL

Python 10 Updated May 20, 2025

DeepLiterature: A fully open-source intelligent research assistant that integrates search, code execution, link resolution, and information expansion, with multiple tools working together to facili…

Python 94 8 Updated Mar 19, 2025

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.…

Python 775 94 Updated Mar 13, 2025

Genome modeling and design across all domains of life

Jupyter Notebook 3,177 365 Updated Sep 17, 2025