Skip to content
View gsajko's full-sized avatar

Block or report gsajko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Let's all work together on a 3d scanner benchmark for desktop 3d scanners

Python 143 1 Updated Jun 13, 2026

Classroom-ready open-source educational exoskeleton for biomedical and control engineering

TeX 6 4 Updated Jan 30, 2025

A repository for research on medium sized language models.

Python 537 79 Updated Jun 6, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 7,228 507 Updated Oct 30, 2025

Twitter Scraper

Python 658 93 Updated Mar 22, 2026

Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI

Python 221 24 Updated Apr 29, 2024
Python 196 11 Updated May 5, 2024

auto fine tune of models with synthetic data

Python 78 5 Updated Feb 14, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

80,297 9,361 Updated Feb 5, 2026

Minimalistic large language model 3D-parallelism training

Python 2,725 318 Updated May 26, 2026

structured outputs for llms

Python 13,208 1,090 Updated Jun 23, 2026

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,694 729 Updated Jun 23, 2026
Jupyter Notebook 339 53 Updated Jun 26, 2023

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Jupyter Notebook 42,690 8,448 Updated Jun 10, 2026

Scripts to create a basic search on podcast data in general

Python 10 1 Updated Dec 23, 2022

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Python 876 66 Updated Jun 16, 2023

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python 1,427 166 Updated Apr 17, 2025

Next.js app for serverless deployments of OpenAI Whisper on Banana.dev

JavaScript 96 35 Updated Sep 22, 2022

AI-powered CLI tool to help you remember bash commands.

Rust 333 16 Updated Jul 6, 2024

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

Jupyter Notebook 245 38 Updated Sep 12, 2022

An Obsidian.md plugin to save tweets as Markdown files.

TypeScript 220 16 Updated May 8, 2023

Supporting materials/code examples for my course in data engineering for machine learning.

Python 39 6 Updated Nov 15, 2022

Resumes generated using the GitHub informations

JavaScript 62,861 1,369 Updated Feb 15, 2023

An underground, wireless, open-source, low-cost system for monitoring oxygen, temperature, and soil moisture

C++ 7 Updated Nov 19, 2021

Free MLOps course from DataTalks.Club

Jupyter Notebook 14,840 2,972 Updated Jun 10, 2026

Building a real-time twitter graph of your friends

C# 265 14 Updated May 15, 2022

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 5,176 424 Updated Jun 2, 2026

Repo for Ecosystem Creator project based on Synthetic Silviculture Paper

C++ 4 Updated Nov 2, 2021

a cheat-sheet for mathematical notation in code form

15,482 1,089 Updated Mar 8, 2022

Question Generation - Question Answering for Automatic Flashcards

JavaScript 66 5 Updated Mar 14, 2022
Next