Stars
Easily turn your Click CLI into a powerful terminal application
Toolkit for linearizing PDFs for LLM datasets/training
pdbp (Pdb+): A drop-in replacement for pdb and pdbpp. To replace "pdb", add "import pdbp" to an "__init__.py" file.
Unimpeded: Convert Poe.com to OpenAI Interface-Compatible Format! 🔑 畅通无阻: 将 Poe.com 转换为 OpenAI 接口兼容格式!
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Object-recognition using multiple templates in python
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
A Python script to scan an answer sheet outputting the alternatives marked.
✨✨Latest Advances on Multimodal Large Language Models
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
UX-Decoder / X-Decoder
Forked from microsoft/X-Decoder[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
IQA: Deep Image Structure and Texture Similarity Metric
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
Vision Transformer for 3D medical image registration (Pytorch)
This is the implementation of the CDGPT2 model mentioned in our paper 'Automated Radiology Report Generation using Conditioned Transformers'
The code of Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation
Code for Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation (EMNLP-21)
A curated list of radiology report generation (medical report generation) and related areas. :-)
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
A pyQt interface to visualize NIfTI images as grided images.