Browse free open source Text Processing software and projects below. Use the toggles on the left to filter open source Text Processing software by OS, license, language, programming language, and project status.

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 1
    Scribus

    Scribus

    Powerful desktop publishing software

    Scribus is an Open Source program that brings professional page layout to Linux, BSD UNIX, Solaris, OpenIndiana, GNU/Hurd, Mac OS X, OS/2 Warp 4, eComStation, and Windows desktops with a combination of press-ready output and new approaches to page design. Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.
    Leader badge
    Downloads: 13,983 This Week
    Last Update:
    See Project
  • 2
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 116 This Week
    Last Update:
    See Project
  • 3
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 62 This Week
    Last Update:
    See Project
  • 4
    The Guide
    The Guide is a tree-based information management tool. It lets you to organize information as nodes in a tree. (A two-pane rich-text outliner for Windows.)
    Downloads: 40 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Calc2LaTeX is an OpenOffice.org Calc (Spreadsheet) macro for converting tables. It makes making tables on LaTeX very easy.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 6

    Bulgarian language support

    Spell check, grammar check and hyphenation for Bulgarian language

    The goal of this project is to provide spell check, grammar check and hyphenation for Bulgarian language for Open Source products such as OpenOffice.org, LibreOffice, TeX, aspell, ispell, hunspell etc.
    Leader badge
    Downloads: 37 This Week
    Last Update:
    See Project
  • 7
    CONVERTCP

    CONVERTCP

    Text File Codepage Converter for the Windows command line

    This command line utility is a codepage converter to be used to change the character encoding of text. It fully supports charsets such as ANSI code pages, UTF-8, UTF-16 LE/BE, UTF-32 LE/BE, and EBCDIC. It's designed to convert big text files, too. It runs on Windows XP onwards (tested on XP, Windows 7, Windows 8.1, Windows 10, and Windows 11). The "readme.txt" file and the Wiki gives you some more information. You'll find the compiled tool for 32 bit (x86) and 64 bit (x64) Windows in the "bin" directory. The C source code is available in the "src" directory. Just click on the "Files" tab. Regardless if you have or don't have a SourceForge account - whenever you have questions about CONVERTCP or you want to give feedback then you are welcome to post it in the forum. Click on the "Discussion" tab.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    MarkDown Reader LT

    MarkDown Reader LT

    Lightning fast markdown reader for Windows

    A tiny MarkDown reader for Windows View your MarkDown files in style! * Lightining fast loading & conversion * Renders pages using Bootstrap CSS * Automatically creates html files * Absolutely free! #What is MarkDown? MarkDown is a document format that allows you to write using an easy-to-read, easy-to-write plain text format. Markdown is the opposite of markup, because markup (HTML etc.) makes your contents un-readable, un-editable. On the other hand, Markdown encourages creation of great content using so simple rules. Yet, in order to convert those rules in to presentable documents, you need a MarkDown parser. This is where the MarkDown Reader LT comes in! MarkDown Reader setup associates all .md files with markdown reader, so it is really easy to view your markdown files using double click.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 9
    Perpetual Notes

    Perpetual Notes

    Write beautifully. Organize easily. Find everything.

    Take notes faster. Find information easily. Save notes in RTF with rich text formatting and images, meeting notes, web pages, projects, travel plan, research drafts - with Perpetual Notes as your note taking app, have fun with note taking again. Runs on Windows 7/8/10.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 11
    PTools is a set of useful tools written in Pascal. It includes: scientific calculator, archiver, text editor, remote adminitration and more. It is designed to be portable across operating systems, specially Java-based mobiles, Windows and Unixes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    TextExtractor

    TextExtractor

    Extracts plain text from a variety of different file types

    TextExtractor extracts plain text from hundreds of different file types, storing the text extracted in suitably named text files. TextExtractor 1.10 works in six different modes :- Instant Mode - Just select any file and extract the text from it. Batch Mode - Select a group of files and extract the text from all of them in one go. Polling Mode - Watch a folder location, processing new files as they appear there. Hierarchical Mode - Extract Text from files in a directory hierarchy. File List Mode - Extract Text from files in a list. File Viewer - Select individual files from a file tree to see their textual content.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    JODReports is a solution for generating dynamic documents and reports in Java based on the OpenDocument format (ODF). Templates can be easily composed with a word processor such as OpenOffice.org Writer. Data sources include POJOs and XML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14

    MindRaider

    MindRaider is a personal notebook and outliner.

    MindRaider is a personal notebook and outliner. Where do you keep private remarks like ideas, plans, gift tips and howtos? Loads of documents and remarks spread around the file system? Can you find a remark when you need it? No? Try MindRaider!
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Dictionary program for Palm.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    The aim of this project is to provide exportfilters and extensions that enable the popular Star/OpenOffice Suite to be used as a single source publishing application. Initial target formats are PDF and HTML customized for Print, Web and Presentations.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    References bibliographic software
    Bibliographic software supporting LaTeX/BibTeX, LibreOffice, OpenOffice and other word processors
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    Gyrfalcon is a note / thought / task management system. Take your notes and other bits of information and: put the notes in hierarchical trees, tag, search, hyperlink, etc. Gyrfalcon is also designed with a clean interface that avoids modal interactions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    J2ME Memopad is a simple MIDP application designed to allow storage and retrieval of notes. It will have the ability to search and generate a list of results, as well as categorize your memos. The basic design of the memopad is similar to the Palm.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    JReferences is a tool to store and retrieve bibliographic references from a file or MySQL database. It reads BibTeXML, DocBook XML and RIS type references, and can output these and BibTex. A bibtex like alternative is also provided for DocBook XML docu
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Xelem is a compact Java-library to read and write Excel files of type SpreadsheetML. It can produce sophisticated, intricate and complex spreadsheets from within any Java program. And, since the release of xelem.2.0, it can read xml-spreadsheets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    PageEditor

    PageEditor

    A free, fast and easy-to-use word processor app.

    PageEditor is a free, fast and easy-to-use word processor app with can read, edit and save text files. It also can edit PDF files and have a integrated web browser.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Xneur is a "hot" text corrector for different languages. It's corrects phonetic and layout errors when you are writing text on the X server.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next