Datasets collection and preprocessings framework for NLP extreme multitask learning
-
Updated
Jul 9, 2025 - Python
Datasets collection and preprocessings framework for NLP extreme multitask learning
A complete imitation learning pipeline for bar alignment using the UR5 robot in NVIDIA Isaac Sim. Includes manual data collection with a game controller, dataset organization for LeRobot, diffusion policy training, and policy deployment through ROS2.
Migrated to pyedmine
Script automation for Dataset Collection and Building with Tkinter GUI.
Automated drone flight system for collecting multiview images.
Techgium hackathon submission
A GUI for managing, visualizing, and analyzing competitive programming datasets with a PyQt6 GUI.
General-purpose Python image scraper using Selenium — download images by keyword for any ML training pipeline, dataset, or visual research.
自动搜集获取相关的视频,接管浏览器海量搜集,并自动判别。Automatically collects relevant videos, takes over bulk browser collection, and makes automatic judgments.
CLI-based Twitter (X) scraper built with Python for collecting tweet datasets for NLP tasks such as sentiment analysis, opinion mining, and user behavior analysis.
Add a description, image, and links to the dataset-collection topic page so that developers can more easily learn about it.
To associate your repository with the dataset-collection topic, visit your repo's landing page and select "manage topics."