-
Nest Digital
- LUCKNOW, UTTAR PRADESH INDIA
- https://atrisaxena.github.io
Highlights
Starred repositories
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Fine tune Gemma 3 on an object detection task
Includes document indepth learning of langchain for LLM development
A project to develope a self driving car with carla simulator and ROS2 carla bridge
A Happy and lightweight Python Package that Provides an API to search for articles on Google News and returns a JSON response.
Python tool for converting files and office documents to Markdown.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Lets make video diffusion practical!
Integration of AutoWare AV software with the CARLA simulator
Open-source, low-latency key/value engine built on Valkey with query subscriptions and hierarchical storage tiers.
A Conversational Speech Generation Model
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
A library for mechanistic interpretability of GPT-style language models
SpeechGPT Series: Speech Large Language Models
A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…
This is my bachelor's thesis, which contains three main features: lane detection, road segmentation, and a Forward Collision Warning (FCW) system
Development of Deep Learning algorithms for Drivable Area Segmentation, Lane Segmentation, Traffic Sign Detection and Classification with data collected and labeled by Ford Otosan.
Fast Corrects for fisheye distortion in an image.
A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Official Code for DragGAN (SIGGRAPH 2023)
Python bindings for the Transformer models implemented in C/C++ using GGML library.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
The official PyTorch implementation of L2CS-Net for gaze estimation and tracking
Cross-platform, customizable ML solutions for live and streaming media.