Skip to content
View theyorubayesian's full-sized avatar

Organizations

@castorini

Block or report theyorubayesian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Resources for those studying the fast-moving pace of AI policy

59 16 Updated Mar 26, 2026

Markdown memory system for you and your AI agent

Rust 1,124 51 Updated Jun 10, 2026

Machine Learning Systems

Python 24,922 2,997 Updated Jun 16, 2026

A curated list of LLM datasets for African languages.

7 1 Updated Jan 23, 2026

Library for fast text representation and classification. Fix compatibility with numpy 2

HTML 15 2 Updated Nov 21, 2024

OSS Investor relationship hub for founders

Go 59 9 Updated Mar 6, 2026

Adaptive Softmax implementation for PyTorch

Python 81 15 Updated May 4, 2019

Master's Thesis. Open source and LaTeX.

TeX 6 Updated Apr 13, 2022

LLM Eval leaderboard for African Languages

Jupyter Notebook 9 1 Updated Feb 27, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,390 3,436 Updated Jun 16, 2026
Python 8 2 Updated Oct 3, 2024
Jupyter Notebook 6 3 Updated Jan 29, 2026

Bringing BERT into modernity via both architecture changes and scaling

Python 1,690 146 Updated Mar 1, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,222 189 Updated Aug 26, 2025

Geographically-informed language identification

Python 7 1 Updated Mar 14, 2024

Whisperer provides an unopinionated approach for running multiple agents in Elixir

Elixir 28 1 Updated Dec 10, 2024

Benchmarking Large Language Models for FHIR

TypeScript 43 8 Updated Feb 4, 2026

Direct Preference Optimization from scratch in PyTorch

Python 130 12 Updated Apr 7, 2025

Tools for merging pretrained large language models.

Python 7,156 738 Updated Jun 13, 2026

evolve llm training instruction, from english instruction to any language.

Python 120 15 Updated Sep 15, 2023

Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code

70 94 Updated Jun 13, 2026
HTML 130 20 Updated Sep 12, 2025

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.

755 47 Updated Jun 16, 2026

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 13,441 2,148 Updated Jun 15, 2026

IA's public Wayback Machine (moved from SourceForge)

Java 842 169 Updated Mar 1, 2024

MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.

Python 104 10 Updated Nov 16, 2025

stoplists for African languages generated from the ASP corpus

Ruby 14 4 Updated Jan 16, 2016

Source stories from the African Storybook Project in Markdown format

22 13 Updated Jan 25, 2026

maximal update parametrization (µP)

Jupyter Notebook 1,727 105 Updated Jul 17, 2024

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 3,008 202 Updated Jun 9, 2026
Next