A java-based search engine that searches the data from the database using different data structure concepts.
-
Updated
Aug 20, 2020 - HTML
A java-based search engine that searches the data from the database using different data structure concepts.
Transform HTML into absolutely any format imaginable.
MERN Ecommerce Carpets Shop (Front-end)
Code and data for SORE (ACL 2025), a semantic boilerplate remover.
A console based web search engine developed in Java.
A Scrapy package based web scraper for collecting Kurdish text data from websites. The tool recursively crawls specified domains, extracts article content using Trafilatura, and filters results by language using Facebook's FastText language identification model.
Shift noisy web pages into clean, context-ready text for LLMs — Rust library & MCP server
The web search engine was a try to make a mini version of the other popular search web searches engines such as Google, Bing, or YouTube. The web search engine that we built is developed using various data structures to perform efficiently to result accurately. First of all, we collected the web pages using web crawler using python. The web craw…
CTX (Context Transfer Format) — universal interchange format for LLM web content consumption
I'm an aspiring Full Stack Developer specializing in the MERN stack (React.js, Next.js, Node.js, PostgreSQL, Prisma ORM). I build real-world projects with clean, efficient code, focusing on modern UI/UX and robust backend solutions. Proficient in JavaScript, Python, and Java
A simple utility to convert HTML into text, keeping as much content as possible
This program takes a *.mbox file or .txt file that contains the emails downloaded with Google Takeout. Then that file is processed with this program to gather the relevant information and deliver a *.txt file with it.
Python library for converting HTML to markup or plain text
Standalone .NET Converter library, not require Adobe Acrobat component nor Microsoft Office Interop Assemblies, to convert PDF, DOCX, XLSX, HTML, Image, CSV, RTF, TXT in .NET framework
Add a description, image, and links to the html-to-text topic page so that developers can more easily learn about it.
To associate your repository with the html-to-text topic, visit your repo's landing page and select "manage topics."