Skip to content
View Albert-Ma's full-sized avatar

Organizations

@ict-bigdatalab

Block or report Albert-Ma

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repo for paper "Agentic-R: Learning to Retrieve for Agentic Search" (ACL 2026 Findings)

Python 87 3 Updated Apr 9, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,610 577 Updated Jun 13, 2026

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 422 18 Updated Jan 29, 2026

Fully open reproduction of DeepSeek-R1

Python 26,309 2,437 Updated Apr 2, 2026

Repo for paper "ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability" (ACL 2026 Main)

Python 178 9 Updated Apr 9, 2026

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

140,420 34,673 Updated Jun 12, 2026

Model Context Protocol Servers

TypeScript 87,201 10,999 Updated Jun 7, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,376 1,044 Updated Jun 4, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,968 1,055 Updated May 7, 2026

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,967 326 Updated Jan 14, 2026

Collection of leaked system prompts

14,652 2,084 Updated Jun 12, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,706 1,058 Updated Apr 30, 2026

This is the code of MMOA-RAG.

Python 110 8 Updated May 11, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 42,917 3,489 Updated Jun 13, 2026

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 36,541 5,161 Updated Jun 14, 2026

MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]

Python 27 2 Updated Nov 3, 2024
Python 40 3 Updated Apr 6, 2026

The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.

590 35 Updated Jul 29, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,822 18,041 Updated Jun 14, 2026

JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synthetic, adversarial, in-the-wild, and multi-language scenarios…

Python 63 6 Updated Dec 13, 2024

Code and data repository for two papers (ACL & EMNLP 2024) on the topic of collapse in model editing.

Python 9 1 Updated Dec 20, 2024

Grok open release

Python 51,690 8,478 Updated Aug 30, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 29,097 2,974 Updated Apr 9, 2026

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,621 103 Updated Dec 20, 2025

The official PyTorch implementation of Google's Gemma models

Python 5,692 599 Updated May 30, 2025

Code for AAAI 2024 paper Wikiformer

Python 20 Updated Dec 21, 2023

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,600 99 Updated Apr 17, 2026

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,174 935 Updated Mar 11, 2025

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,948 372 Updated Jun 3, 2026
Next