-
Massachusetts Institute of Technology
- Cambridge, MA
- https://nsidn98.github.io/
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Multiagent research environment toolbox based on Unreal Engine
[arXiv 2023] Embodied Task Planning with Large Language Models
music21 is a Toolkit for Computational Musicology
Official repository for our work on micro-budget training of large-scale diffusion models.
[NeurIPS 2024] Official code repository for MSR3D paper
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
Code used in our NeurIPS 2022 paper 'AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments'
Fast and flexible multi-agent gridworld reinforcement learning environments.
A generative and self-guided robotic agent that endlessly propose and master new skills.
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
An open source framework for research in Embodied-AI from AI2.
Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
📱👉🏠 Perform conditional procedural generation to generate houses like your own!
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
A high-throughput and memory-efficient inference and serving engine for LLMs
API to run VirtualHome, a Multi-Agent Household Simulator
A Structured Output Framework for LLM Outputs
Utility functions when working with Ai2-THOR. Try to do one thing once.
Generating Robotic Simulation Tasks via Large Language Models
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Repo for reproduction of sequential social dilemmas
[ICLR-2025] POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can…