Skip to content
View archiki's full-sized avatar

Organizations

@csalt-research

Block or report archiki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,488 32,896 Updated Apr 17, 2026

Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"

Python 17 3 Updated Mar 2, 2026

A curated list of research papers and resources on code-switching

335 40 Updated Jan 31, 2026

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,201 6,674 Updated Sep 30, 2025

Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"

Python 14 Updated Sep 19, 2025
Python 164 38 Updated Jul 16, 2025
Python 9 3 Updated Jun 16, 2025

Reasoning by Communicating with Agents

Python 29 5 Updated Apr 29, 2025

List of AI Residency Programs

3,285 271 Updated Apr 4, 2025
Python 323 14 Updated Sep 18, 2024

Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"

Python 25 1 Updated Sep 11, 2024

PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models

Python 25 3 Updated Jul 22, 2024
Python 10 1 Updated Mar 5, 2024

natual language guided image captioning

Python 87 7 Updated Feb 11, 2024

MEND: Fast Model Editing at Scale

Python 259 33 Updated Aug 30, 2023

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,886 318 Updated Mar 14, 2023

XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

Python 652 110 Updated Jan 4, 2023

Speech Recognition using DeepSpeech2.

Python 2,140 625 Updated Dec 13, 2022

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

Python 28 2 Updated May 2, 2022