Skip to content
@DataArcTech

DataArcTech

Welcome to DataArc Tech Inc.

⚡DataArcTech⚡

👉 Data-Driven, Intelligently Synthesized

🔥 We specialize in intelligent synthetic data generation and knowledge-augmented LLM reasoning technologies.

🌟 With a focus on context graphs and multi-agent systems, we build more efficient and trustworthy next-generation data and model infrastructure.

🚀 Through open-source projects and in-depth research, we explore the full technical cycle from data synthesis and continual pre-training to model evaluation.

👋 Join us in contributing high-quality algorithms, data, and insights to the open-source community.

         

Popular repositories Loading

  1. ToG ToG Public

    This is the official github repo of Think-on-Graph (ICLR 2024). If you are interested in our work or willing to join our research team in Shenzhen, please feel free to contact us by email (xuchengj…

    Python 609 69

  2. DataArc-SynData-Toolkit DataArc-SynData-Toolkit Public

    Synthetic Data Generation Platform By DataArcTech

    Python 284 5

  3. LLM-as-a-Judge LLM-as-a-Judge Public

    162 5

  4. SQL-R1 SQL-R1 Public

    [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"

    Python 115 16

  5. ToG-2 ToG-2 Public

    Python 100 17

  6. ChartMoE ChartMoE Public

    [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

    Jupyter Notebook 94 8

Repositories

Showing 10 of 26 repositories

Top languages

Loading…

Most used topics

Loading…