Skip to content
View sweta20's full-sized avatar

Block or report sweta20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM Council works together to answer your hardest questions

Python 16,897 3,354 Updated Nov 22, 2025

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!

Python 1,814 347 Updated Mar 24, 2026

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,503 125 Updated Nov 13, 2025

Cell2Sentence: Teaching Large Language Models the Language of Biology

Jupyter Notebook 847 123 Updated Nov 4, 2025
Python 308 23 Updated Jul 15, 2024

A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabling layer-wise analysis of hidden states and predictions.

Jupyter Notebook 163 16 Updated Aug 14, 2025
Jupyter Notebook 7 Updated Nov 5, 2022

LLM-based QAG framework for MT Evaluation

Python 4 1 Updated May 13, 2025

YSDA course in Natural Language Processing

Jupyter Notebook 10,529 2,748 Updated Feb 26, 2026

A framework for evaluating Machine Translation models.

Python 12 4 Updated May 26, 2025

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,684 57 Updated Mar 9, 2026

Toolkit used to collect translations from various online providers and LLMs

Python 13 3 Updated Sep 16, 2025

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

2,148 261 Updated Jun 6, 2024
JavaScript 68 8 Updated Jul 19, 2022

A benchmark with locally sourced multilingual questions for 31 languages.

Python 18 4 Updated Sep 2, 2025

Example competitions for the CodaLab project.

Python 27 26 Updated Apr 7, 2026
Python 5 Updated Jun 9, 2025

Examples and guides for using the Gemini API

Jupyter Notebook 16,975 2,583 Updated Apr 9, 2026

Benchmark for evaluating open-ended generation

Python 51 7 Updated Nov 6, 2024

A curated list of research papers and resources on code-switching

334 40 Updated Jan 31, 2026

Quantifying Language Confusion in LLMs.

Jupyter Notebook 2 Updated Oct 17, 2024

Minimal set up to render text as images.

Python 1 Updated Sep 2, 2024

[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python 43 9 Updated May 22, 2025
Python 8 Updated Jan 27, 2026

Paper list for open-ended language generation

191 19 Updated Nov 17, 2022

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,017 6,034 Updated Aug 16, 2024

Generate text images for training deep learning ocr model

Python 1,461 388 Updated Jan 17, 2022

Render documents on a virtual paper with folds and other types of damage using blender geometry nodes.

Python 26 2 Updated Aug 14, 2023

A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation

Python 87 Updated Sep 27, 2025
Next