- All languages
- Ada
- Agda
- Arc
- Assembly
- Batchfile
- Brainfuck
- C
- C#
- C++
- CMake
- CSS
- Clojure
- CoffeeScript
- Common Lisp
- Common Workflow Language
- Coq
- Cuda
- Cython
- Dockerfile
- Elixir
- Elm
- Emacs Lisp
- Erlang
- Fortran
- FreeMarker
- Go
- Groovy
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- LLVM
- Lua
- MATLAB
- Makefile
- Mathematica
- Mojo
- Nextflow
- Nim
- OCaml
- Objective-C
- OpenQASM
- PHP
- Perl
- Processing
- PureBasic
- Python
- R
- Racket
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- SWIG
- Scala
- Scheme
- Shell
- Smalltalk
- Standard ML
- TSQL
- TeX
- TypeScript
- VHDL
- Verilog
- Vim Script
- Visual Basic
- Vue
- WDL
- Web Ontology Language
- WebAssembly
- Zig
Starred repositories
An interoperable Python framework for biomolecular simulation.
This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.
Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders
Chai-1, SOTA model for biomolecular structure prediction
A comprehensive benchmark on the performances of multiple protein backbone generative models.
Official repository for the Boltz-1 biomolecular interaction model
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
This is a repo with links to everything you'd ever want to learn about data engineering
Vector (and Scalar) Quantization, in Pytorch
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
Data release for the ImageInWords (IIW) paper.
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
A Benchmark for Efficient and Compositional Visual Reasoning
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Open Source Implementation of the Unique Ring Families Algorithm (Cheminformatics)
Official Repository of "SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery" (ECCV 2024)
DeepDrugDomain: A versatile Python toolkit for streamlined preprocessing and accurate prediction of drug-target interactions and binding affinities, leveraging deep learning for advancing computati…
Network-Oriented Repurposing of Drugs Python Package
O1 Replication Journey: A Strategic Progress Report – Part I
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
LLaVA-O1: Open Large Reasoning MLLMs Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace