Standalone .NET Converter library, not require Adobe Acrobat component nor Microsoft Office Interop Assemblies, to convert PDF, DOCX, XLSX, HTML, Image, CSV, RTF, TXT in .NET framework
-
Updated
Nov 5, 2018 - C#
Standalone .NET Converter library, not require Adobe Acrobat component nor Microsoft Office Interop Assemblies, to convert PDF, DOCX, XLSX, HTML, Image, CSV, RTF, TXT in .NET framework
A java-based search engine that searches the data from the database using different data structure concepts.
The web search engine was a try to make a mini version of the other popular search web searches engines such as Google, Bing, or YouTube. The web search engine that we built is developed using various data structures to perform efficiently to result accurately. First of all, we collected the web pages using web crawler using python. The web craw…
A console based web search engine developed in Java.
MERN Ecommerce Carpets Shop (Front-end)
This program takes a *.mbox file or .txt file that contains the emails downloaded with Google Takeout. Then that file is processed with this program to gather the relevant information and deliver a *.txt file with it.
A simple utility to convert HTML into text, keeping as much content as possible
I'm an aspiring Full Stack Developer specializing in the MERN stack (React.js, Next.js, Node.js, PostgreSQL, Prisma ORM). I build real-world projects with clean, efficient code, focusing on modern UI/UX and robust backend solutions. Proficient in JavaScript, Python, and Java
Code and data for SORE (ACL 2025), a semantic boilerplate remover.
Python library for converting HTML to markup or plain text
A Scrapy package based web scraper for collecting Kurdish text data from websites. The tool recursively crawls specified domains, extracts article content using Trafilatura, and filters results by language using Facebook's FastText language identification model.
Add a description, image, and links to the html-to-text topic page so that developers can more easily learn about it.
To associate your repository with the html-to-text topic, visit your repo's landing page and select "manage topics."