Browse free open source Text Processing software and projects below. Use the toggles on the left to filter open source Text Processing software by OS, license, language, programming language, and project status.

  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 1
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file, find filenames fast using index files, rename many files using expressions, copy part of a file, change times of a file, set file time from filename, print colored text to terminal, convert csv to tab separated, download files from web, send http or udp requests, print tcp or udp traffic, create hexdump of files, join many text files into one, list nested .zip .tar .tar.gz .tar.bz2 archive contents. Fully portable, for USB stick, without installation. For Windows, Linux and Mac OS/X.
    Leader badge
    Downloads: 630 This Week
    Last Update:
    See Project
  • 2
    Midnight Commander for Windows

    Midnight Commander for Windows

    Native Midnight Commander 4.8.33 for Windows/Win32

    Windows XP+/32 bit native port of GNU Midnight Commander, based on the current 4.8.33 development stream. Midnight Commander (also known as mc) is a free cross-platform orthodox file manager and a clone of Norton Commander. Supports both the recent Windows 10 enhanced plus legacy consoles, 16 and 256+ colour modes are available within either. Features include the ability work with common archive formats as if they were simply another directory, and function as an SFTP/FTP client. Also includes an built-in editor/viewer, features include syntax highlighting for many languages, macros, code snippets, simple integration with external tools, automatic indentation, mouse support, clipboard and the ability to work in both ASCII and hex modes. MC can also rename groups of files, move files to a different directory at the same time as it.renames them. It lets the user specify the original and resulting file names using wildcard. https://github.com/adamyg/mcwin32
    Leader badge
    Downloads: 272 This Week
    Last Update:
    See Project
  • 3
    PTools is a set of useful tools written in Pascal. It includes: scientific calculator, archiver, text editor, remote adminitration and more. It is designed to be portable across operating systems, specially Java-based mobiles, Windows and Unixes.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Drag-and-drop files/directories/HTML-URLs into a Java GUI. Perform text operations on the files into output files. Operations include concatention, text and regex editing, and other file/string/row/column/script operations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Moved to http://code.google.com/p/evil-programmers/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    GitSync is a shell script designed to simplify the usage of the version control system GIT (see www.git-scm.com for more information) by providing a "do everything to sync my repository" command.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Visual xsltproc is a tool which help to write xslt file, and debug it to find errors. It writes xml, and generates xml (Syntax highlighting of XML & line Nr.). Finally if the result is XSL-FO it generates the pdf on Apache FOP java. Build on QT4.2.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    ZipDB is a program that allows databases to be made easily and efficiently using compressed zip files. An easy-to-use gui is the key feature allowing quick access to all files in the archive, and since the archives are .zip they can always be opened on
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    A knowledgment management system written in Java under JBoss 4.2.3 Server, with richfaces 3.3.0BETA4. Including fileconversion from html to pdf and rich:editor component without special syntaxing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A backup/transfer files program to move and process files between computers in a production (industrial) process. The processing of file begins when it are created or copied to directory and they can be redirected to other machine to be used.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CTA (conversor de ficheros de texto) is a program to change the format of one or multiple text files between unix text file format and dos/windows text file format. With this easy program you can see text files correctly, whitout annoying symbols or comp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Data monkey was to be a powerful windows data translation and conversion application to convert complex text file formats, database data, data from custom devices into other formats. I never finished it and was hoping someone would take it over.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The FileNamer for EML allows you to rename a lot of .eml files like 001.eml, 002.eml, 003.eml to something more descriptive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    This program is used as a tool to check for ASCII code and strip it out to a new file to recover documents that got corrupted or find hidden data in viruses that contain text by the virus writer or spies puting comments in images or other files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Full implementation of ISO 2022 files (ECMA-35) as a library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    PanDocElectron

    Graphical User Interface for PanDoc for Linux, Mac & Windows

    PanDoc Graphical User Interface implemented with Electron for Linux, Mac and Windows. It support users in converting source documents into various other formats like docx, odt, html and reveal documentation. The zip files contain the full source code because PanDocElectron is written in HTML/Javascript. Electron is used more or less as browser that runs the HTML/Javascript application. [Download PanDocElectron](https://sourceforge.net/p/pandocelectron/wiki/Home/) Extract the zip-file from Downloads in your Documents folders (directory Documents/PanDoc). It contains support files PanDocElectron. The Application PanDocElectron is available on - (Homepage) http://github.com/niebert/PanDocElectron - (Wiki) https://sourceforge.net/p/pandocelectron/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Application working for any personal issues - easy financing, personal web page creating and handlig, photos archiver and viewer, bookmarking... Everything at one place as .NET application (+ web interface)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This project utilizes the iPod's ability to store and display short text files to allow you to view RSS Feeds, Weather Forecasts, Movie Showtimes, and other text documents on your iPod when you are away from your computer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    QReader - The High Quality Book Reader for Smart Mobiles. Symbian OS - different phones, different supported file types, designed to be most useful book reader for PDA/cellular.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Replasol is basically Visual Fart. The purpose is to create a simple front end GUI for FART (Find and Replace Text) in JAVA that allows you to use all of the command line options quickly and easily, as well as store .wind files, containing multiple Farts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A Simple Tool Kit for Programmers and general users. The purpose of this project is to benefit both the programmers and the end users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A graphical MS Windows version of the ever useful "tail" command in *nix. Features RegEx highlighting, multiple notification methods (Flash, Beep, Email, Balloon), alternating line colors for readability, Threshold Seperators, and simple XML Config
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    XML Truncator-Fixer

    XML Truncator-Fixer

    Truncates XML at first error-uses the xmllint to add correct end tags.

    XML is intolerant of errors. This program will automatically salvage XML files by finding the first XML error, then truncating a configurable number of characters before the error and finally using the great xmllint to recover the xml file by automatically adding the correct end tags. The program starts with a default of 50 characters truncation. This truncation value may need to be changed if the truncation happens in the middle of a complex tag as xmllint may not know how to recover the file correctly. Additionally some extra text or data may be recoverable by changing the truncation amount to less than 50 characters.The extra truncation is needed in the first place because the XML validator used is not able to perfectly identify where the XML corruption begins and will often not flag the corruption for several characters after it begins.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    csvtoxml will convert parse csv comma separated value data into xml. a command line console utility that uses stdin and stdout pipe with more cat, pr, wget, zip, find -exec for added functionality. file stream term c c++ small fast parser unix win osx
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.