Skip to content
View zaidalyafeai's full-sized avatar
:octocat:
Working from home
:octocat:
Working from home

Highlights

  • Pro

Block or report zaidalyafeai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The API to search, scrape, and interact with the web at scale. 🔥

TypeScript 136,199 7,905 Updated Jun 21, 2026

Ain sport a library for online action spotting for video soccer games.

Python 2 1 Updated Sep 4, 2023

Metadata extraction and validation in scientific papers

Python 14 3 Updated Feb 10, 2026

Synthetic data curation for post-training and structured data extraction

Python 1,688 142 Updated Jun 21, 2026

A python package made to generate sequences (greedy and beam-search) from Pytorch (not necessarily HF transformers) models.

Python 18 2 Updated Dec 12, 2025

JAX - A curated list of resources https://github.com/google/jax

2,132 171 Updated Jan 20, 2026

Minimal library to train LLMs on TPU in JAX with pjit().

Python 299 38 Updated Jun 2, 2026

LLM training code for Databricks foundation models

Python 4,413 588 Updated Mar 25, 2026

Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebooks.

JavaScript 421 48 Updated Mar 1, 2024

Arabic Tokenization Library. It provides many tokenization algorithms.

Jupyter Notebook 111 21 Updated Jan 4, 2024

The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset

Python 161 9 Updated Apr 23, 2024

Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!

Python 9,317 779 Updated Jun 17, 2026

Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.

Jupyter Notebook 46 8 Updated Apr 3, 2025

Ya'rob I'rab Arabic Inflection

Python 41 3 Updated Sep 2, 2024

A data preprocessor for the Quranic Treebank using neural networks. Divides longer verses into smaller chunks.

Python 12 3 Updated Jul 4, 2023

Hey 👋, Glad to see you here! Check out this repository to learn more about me 🤓. You can also use it to make your awesome GitHub README ✨ (Don't Just Fork, Star Too 😅)

270 269 Updated Jun 20, 2026
Python 16 2 Updated Aug 22, 2023

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Python 321 14 Updated Jun 3, 2024

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

Python 141 17 Updated May 3, 2026

Maha is a text processing library specially developed to deal with Arabic text.

Python 216 19 Updated May 25, 2026

Several deep learning models for restoring Arabic diacritics using Pytorch.

Python 38 12 Updated Apr 14, 2022

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

Python 463 53 Updated Nov 5, 2022

This directory gathers the tools developed by the Data Sourcing Working Group

Python 31 6 Updated Oct 25, 2021

Toolkit for creating, sharing and using natural language prompts.

Python 3,025 379 Updated Oct 23, 2023

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 42,970 3,506 Updated Jun 19, 2026

Set of functionalities enable Arabic website developers to serve professional search, present and process Arabic content in PHP

PHP 336 58 Updated Oct 4, 2025

End to end Arabic TTS system based on tacotron

Python 127 36 Updated Apr 5, 2024
Next