Skip to content
View sweta20's full-sized avatar

Block or report sweta20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM Council works together to answer your hardest questions

Python 18,027 3,515 Updated Nov 22, 2025

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!

Python 1,835 345 Updated Mar 24, 2026

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,521 126 Updated Nov 13, 2025

Cell2Sentence: Teaching Large Language Models the Language of Biology

Jupyter Notebook 856 128 Updated Nov 4, 2025
Python 308 22 Updated Jul 15, 2024

A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabling layer-wise analysis of hidden states and predictions.

Jupyter Notebook 170 18 Updated Aug 14, 2025
Jupyter Notebook 9 Updated Nov 5, 2022

LLM-based QAG framework for MT Evaluation

Python 4 1 Updated May 13, 2025

YSDA course in Natural Language Processing

Jupyter Notebook 10,549 2,747 Updated Feb 26, 2026

A framework for evaluating Machine Translation models.

Python 12 4 Updated Apr 21, 2026

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,710 59 Updated Apr 23, 2026

Toolkit used to collect translations from various online providers and LLMs

Python 13 3 Updated Sep 16, 2025

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

2,158 261 Updated Jun 6, 2024
JavaScript 69 8 Updated Jul 19, 2022

A benchmark with locally sourced multilingual questions for 31 languages.

Python 18 5 Updated Sep 2, 2025

Example competitions for the CodaLab project.

Python 27 26 Updated Apr 7, 2026
Python 5 Updated Jun 9, 2025

Examples and guides for using the Gemini API

Jupyter Notebook 17,125 2,608 Updated Apr 30, 2026

Benchmark for evaluating open-ended generation

Python 51 7 Updated Nov 6, 2024

A curated list of research papers and resources on code-switching

337 40 Updated Jan 31, 2026

Quantifying Language Confusion in LLMs.

Jupyter Notebook 2 Updated Oct 17, 2024

Minimal set up to render text as images.

Python 1 Updated Sep 2, 2024

[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python 43 9 Updated May 22, 2025
Python 8 Updated Jan 27, 2026

Paper list for open-ended language generation

191 19 Updated Nov 17, 2022

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,198 6,065 Updated Aug 16, 2024

Generate text images for training deep learning ocr model

Python 1,461 388 Updated Jan 17, 2022

Render documents on a virtual paper with folds and other types of damage using blender geometry nodes.

Python 27 2 Updated Aug 14, 2023

A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation

Python 89 Updated Sep 27, 2025
Next