#
Starred repositories
7
stars
written in HTML
Clear filter
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Instruction Tuning with GPT-4
Learn Chinese on the go - no Internet connection required!
A lightweight script for processing HTML page to markdown format with support for code blocks