Starred repositories
(ICCV 2025) UAVScenes: A Multi-Modal Dataset for UAVs
[NeurIPS 2021] LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation
PyTorch implementation of UNet++ (Nested U-Net).
Monocular Lane Detection Based on Deep Learning: A Survey
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
The mission of the project is to build an agricultural robot (AgriBot) from scratch with the aim of serving as a data-recording platform in fields. For further information about the design and purp…
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
A Procedural World Generator for Robotics Simulation of Agricultural Tasks
Extrinsic Calibration of a Camera and 2d Laser
Pseudo Streaming SenseVoice with Hotwords
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
Implementacíon de modelo OpenAI Whisper tiny afinado en español para ser ejecutado en chips con NPU Rockchip
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Efficient Inference of Transformer models
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…
PyTorch implementation for Sparse LaneFormer
This repository contains an attempt to make a visualization module similar to a Tesla dashboard
Agronav: Autonomous Navigation Framework for Agricultural Robots and Vehicles using Semantic Segmentation and Semantic Line Detection
Lane Detection in Low-light Conditions Using an Efficient Data Enhancement : Light Conditions Style Transfer (IV 2020)
(CVPR21/ECCV20 Workshops) Official repo: Virtual Image Dataset for Illumination Transfer (VIDIT)
[ICCV2023 Oral] LATR: 3D Lane Detection from Monocular Images with Transformer