qiangqiang199

qiangqiang199

Achievements

Stars

opendatalab / WanJuan3.0

WanJuan3.0（“万卷·丝路”）一个作为综合性的纯文本语料库，采集了多个国家地区的网络公开信息、文献、专利等资料，数据总规模超1.2TB，Token总数超过300B，处于国际领先水平，首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成，每个子集的数据规模均超过150GB

42 1 Updated Feb 13, 2025

opendatalab / DocLayout-YOLO

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,887 144 Updated Apr 14, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,646 749 Updated Sep 22, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,030 395 Updated Dec 25, 2025

InternLM / InternLM

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 7,131 501 Updated Oct 30, 2025

InternLM / MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 6,711 673 Updated Jul 4, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 51,004 4,236 Updated Dec 24, 2025

OpenDriveLab / DriveAGI

A Collection of Foundation Driving Models by OpenDriveLab

Python 783 33 Updated Jul 2, 2025

opendatalab / WanJuan2.0-WanJuan-CC

WanJuan-CC是以CommonCrawl为基础，经过数据抽取，规则清洗，去重，安全过滤，质量清洗等步骤得到的高质量数据。

14 Updated Apr 18, 2024

JourneyDB / JourneyDB

180 5 Updated Nov 14, 2025

greshake / llm-security

New ways of breaking app-integrated LLMs

Jupyter Notebook 2,028 140 Updated Jul 17, 2025

opendatalab / MLLM-DataEngine

MLLM-DataEngine: An Iterative Refinement Approach for MLLM

Python 48 5 Updated May 24, 2024

opendatalab / VIGC

AAAI 2024: Visual Instruction Generation and Correction

Python 95 3 Updated Feb 4, 2024

opendatalab / WanJuan1.0

万卷1.0多模态语料

569 28 Updated Oct 20, 2023

fudan-zvg / Semantic-Segment-Anything

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Python 2,296 143 Updated Jun 7, 2023

PJLab-ADG / 3DTrans

An open-source codebase for exploring autonomous driving pre-training

Python 567 66 Updated Jan 19, 2024

BAI-Yeqi / OpenPCSeg

OpenPCSeg: Open Source Point Cloud Segmentation Toolbox and Benchmark

Python 474 46 Updated Apr 27, 2025

opendatalab / labelU

Data annotation toolbox supports image, audio and video data.

Python 1,448 159 Updated Oct 1, 2025

opendatalab / dsdl-sdk

Jupyter Notebook 13 6 Updated May 29, 2024

opendatalab / labelU-Kit

Data annotation component library --provided as NPM packages

TypeScript 141 46 Updated Nov 19, 2025

opendatalab / opendatalab-datasets

datasets resource

127 13 Updated Jul 1, 2025

opendatalab / dsdl-docs

Data Set Description Language Specification （新一代人工智能数据集描述语言DSDL）

HTML 47 6 Updated May 29, 2024

opendatalab / opendatalab-python-sdk

SDK of OpenDataLab - https://opendatalab.org.cn

Python 58 5 Updated Jul 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qiangqiang199

Achievements

Achievements

Block or report qiangqiang199

Stars

opendatalab / WanJuan3.0

opendatalab / DocLayout-YOLO

OpenGVLab / InternVL

InternLM / xtuner

InternLM / InternLM

InternLM / MindSearch

opendatalab / MinerU

OpenDriveLab / DriveAGI

opendatalab / WanJuan2.0-WanJuan-CC

JourneyDB / JourneyDB

greshake / llm-security

opendatalab / MLLM-DataEngine

opendatalab / VIGC

opendatalab / WanJuan1.0

fudan-zvg / Semantic-Segment-Anything

PJLab-ADG / 3DTrans

BAI-Yeqi / OpenPCSeg

opendatalab / labelU

opendatalab / dsdl-sdk

opendatalab / labelU-Kit

opendatalab / opendatalab-datasets

opendatalab / dsdl-docs

opendatalab / opendatalab-python-sdk