Skip to content
View demonzyj56's full-sized avatar
  • Nanyang Technological University

Highlights

  • Pro

Block or report demonzyj56

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 17,556 1,244 Updated Nov 27, 2025

个人收藏书籍列表                                                                                                                                                                                             …

17,064 1,746 Updated Dec 3, 2025

Curated list of data science interview questions and answers

5,313 1,212 Updated Sep 29, 2024

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,202 1,880 Updated Sep 8, 2025

Fully local web research and report writing assistant

Python 8,409 879 Updated Aug 8, 2025

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

Go 9,329 1,000 Updated Dec 17, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,301 327 Updated Dec 15, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,647 1,354 Updated Dec 17, 2025
Python 1,657 98 Updated Sep 30, 2025

A python module to repair invalid JSON from LLMs

Python 4,181 161 Updated Dec 17, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,211 39 Updated Oct 4, 2025

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,534 2,987 Updated Dec 2, 2025

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 7,261 870 Updated Dec 17, 2025

Train transformer language models with reinforcement learning.

Python 16,695 2,366 Updated Dec 18, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,187 7,783 Updated Dec 18, 2025

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…

Python 6,572 687 Updated Oct 13, 2025

The Unofficial TikTok API Wrapper In Python

Python 5,965 1,130 Updated Oct 14, 2025

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,024 580 Updated Apr 24, 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,151 1,814 Updated Feb 26, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,364 407 Updated Jun 28, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,943 2,207 Updated Dec 15, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 70,057 7,602 Updated Dec 18, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,258 1,446 Updated Nov 28, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,762 375 Updated Oct 21, 2025

VideoGen-Eval: Agent-based System for Video Generation Evaluation

253 14 Updated Dec 16, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,642 2,233 Updated Feb 1, 2025

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 507 28 Updated Aug 14, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 75,225 2,359 Updated Dec 19, 2025

A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption

Rust 2,382 63 Updated Oct 20, 2025
Next