Lists (2)
Sort Name ascending (A-Z)
Stars
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
🏅 Collection of Kaggle Solutions and Ideas 🏅
Instruction Tuning with GPT-4
UI for your AI. Open Source Tailwind components tailored for your GPT, generative AI, and LLM projects.
Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
OpenResearcher, an advanced Scientific Research Assistant
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.
A concise resource repository for machine learning
Website for "Real Deep Research for AI, Robotics and Beyond"
✅ Predicting LLM correctness from model internals
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages, ACL 2023
Computationally efficient methods for jointly generation natural language predictions with explanations