Stars
Talk to research papers like talking to authors - Python package with AI agent for arXiv papers
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
SWE-bench: Can Language Models Resolve Real-world Github Issues?
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
A Survey on Benchmarks of Multimodal Large Language Models
本仓库为公众号FinHack炼金术《从零开始卷量化》系列文章示例代码,带你零基础入门量化交易!
A curated list of MUD development resources, tools, and apps.
Diffusion model papers, survey, and taxonomy
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
ImageBind One Embedding Space to Bind Them All
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
✨✨Latest Advances on Multimodal Large Language Models
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
Developer APIs to Accelerate LLM Projects
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
A high-throughput and memory-efficient inference and serving engine for LLMs
Set of tools to assess and improve LLM security.