- Earth
Highlights
research
AndroidWorld is an environment and benchmark for autonomous agents
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo
Parallel Hyperparameter Tuning in Python
Azure Bicep/ARM template to quickly deploy standalone secure research environments following the architecture published at https://docs.microsoft.com/azure/architecture/example-scenario/ai/secure-c…
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
[ICLR 2025] BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models [NAACL 2025]
Official inference framework for 1-bit LLMs
This repository contains code to generate and preprocess Learning with Errors (LWE) data and implementations of four LWE attacks uSVP, SALSA, Cool&Cruel, and Dual Hybrid Meet-in-the-Middle (MitM). …
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Janus-Series: Unified Multimodal Understanding and Generation Models
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Run PyTorch LLMs locally on servers, desktop and mobile
Port of OpenAI's Whisper model in C/C++
Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
Native Ubuntu installations for Apple silicon hardware
Open source code for AlphaFold 2.
This repository will guide you to create your own Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.
Jenetics - Genetic Algorithm, Genetic Programming, Grammatical Evolution, Evolutionary Algorithm, and Multi-objective Optimization