- Bandung, West Java
- https://ardava-barus.netlify.app/
- in/ardava-barus
- rdavaa_
Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
Illegal, unreported, and unregulated (IUU) fishing is a major concern for long term sustainability for the fishing industry and ocean health. By using semi-supervised learning, I developed an Anoma…
🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
Lime: Explaining the predictions of any machine learning classifier
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
PyTorch code and models for VJEPA2 self-supervised learning from video.
Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
[NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed
[ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"
Give your agents the power of the Hugging Face ecosystem
[CVPR 2019, Oral] "Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images" by Wuyang Chen*, Ziyu Jiang*, Zhangyang Wang, Kexin Cui, and Xiaoning Qian
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
[CVPR 2026] Official Code for "UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders"
🐍 Geometric Computer Vision Library for Spatial AI
Visualize PyTorch tensors with a single line of code.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
MCP Server for Computer Use in Windows
Equirk is a Web3 platform helping people with disabilities find inclusive jobs and personalized skill paths through accessible and transparent matching.
Audioscope AI is a web-based platform that uses machine learning and LLMs to analyze respiratory sounds for lung disease detection
Tries for efficient automatic word completion in Python, C++, Ruby & Java.
A Model Context Protocol server for searching and analyzing arXiv papers
Baseline localization and classification models for the xView 2 challenge.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images'. (NeurIPS 2023 …
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
Medical image captioning using OpenAI's CLIP