Open Source Linux Text Processing Software

Text Processing Software for Linux

View 9 business solutions

Browse free open source Text Processing software and projects for Linux below. Use the toggles on the left to filter open source Text Processing software by OS, license, language, programming language, and project status.

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 1
    XMLStarlet is a set of command line utilities (tools) to transform, query, validate, and edit XML documents and files using simple set of shell commands in similar way it is done for text files with UNIX grep, sed, awk, diff, patch, join, etc utilities.
    Leader badge
    Downloads: 1,215 This Week
    Last Update:
    See Project
  • 2
    Bowtie, an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Please cite: Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.
    Leader badge
    Downloads: 771 This Week
    Last Update:
    See Project
  • 3
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file, find filenames fast using index files, rename many files using expressions, copy part of a file, change times of a file, set file time from filename, print colored text to terminal, convert csv to tab separated, download files from web, send http or udp requests, print tcp or udp traffic, create hexdump of files, join many text files into one, list nested .zip .tar .tar.gz .tar.bz2 archive contents. Fully portable, for USB stick, without installation. For Windows, Linux and Mac OS/X.
    Leader badge
    Downloads: 630 This Week
    Last Update:
    See Project
  • 4
    AutoText

    AutoText

    Text expander(auto-type) application

    AutoText is a text expander(auto-typer) application that can minimize time that you spent typing phrases that your often use. You can bind an abbreviation(autotext) to some text snippet, phrase or sentence and that text will be automatically typed by AutoText each time you type the abbreviation. Keyboard keys press/hold/toggle is supported by using macros. AutoText works in any Windows program.
    Leader badge
    Downloads: 74 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    FAR - Find And Replace
    Search and replace operations on file content accross multiple files. Recursive operations within entire directory trees. FAR comes with support for regular expressions (regex) over multiple lines, automatic backup and various character encodings. Run grep like extractions to condense or rearrange sources, or perform bulk file renaming.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 6
    Find And Replace Text command line utility. New & improved version of the well-known grep command, with advanced features such as: case-adaption of the replace string; find (& replace) in filenames, auto CVS edit. Moved to https://github.com/lionello/fart-it
    Leader badge
    Downloads: 48 This Week
    Last Update:
    See Project
  • 7
    PTools is a set of useful tools written in Pascal. It includes: scientific calculator, archiver, text editor, remote adminitration and more. It is designed to be portable across operating systems, specially Java-based mobiles, Windows and Unixes.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 9
    ansifilter

    ansifilter

    ANSI sequence filter

    Ansifilter handles text files containing ANSI terminal escape codes. The command sequences may be stripped or be interpreted to generate formatted output (HTML, RTF, TeX, LaTeX, BBCode and Pango Markup).
    Downloads: 4 This Week
    Last Update:
    See Project
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 10
    TEA is a text editor that provides a wide range of text-processing functions (over 100) and the syntax highlighting. There are two branches of TEA: Qt-based and GTK-based.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    IMPORTANT NOTE: This project has moved to Github: https://github.com/pkozelka/libxml2-pas Pascal units accessing the popular XML API from Daniel Veillard ( http://www.xmlsoft.org ). This should be usable at least from Kylix and Delphi, but hopefully also from other Pascal compilers (like freepascal).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    csart - Clever-Search-And-Replace-Text Search, find and replace text in named files or recursive in all directories (-r). If choosen, strings are only replaced if another key-string occures in the line (-w)
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    A UNIX Shell script wrapper for the VI editor. It is recommended VIB be used as an alias for the VI editor. VIB creates backups of any files edited using VI to a specified location while maintaining X backup revisions, and keeping a revision history of who made changes and when to a master log. VIB is Ideal in environments where you have multiple users logging into a UNIX system and you wish to keep track of whom is modifying files. For example, VIB could be 'aliased' in all new user shell accounts, so that when they edit a file using "vi", a log entry will be generated, and a backup copy of the file with their username will also be notated. The goal of VIB is to assist with tracking who modified what file, when on a UNIX/Linux systsm.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    A tool to transform clipboard content, primarily focused on alphabetical transformation (e.g. Cyrillic to Latin and similar). It also allows other functions to manipulate the clipboard content, like clearing the formatting the content might have.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Drag-and-drop files/directories/HTML-URLs into a Java GUI. Perform text operations on the files into output files. Operations include concatention, text and regex editing, and other file/string/row/column/script operations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    XWinGreek is small keyboard utility for inputting classical (polytonic) Greek in Unix/Linux systems with XWindow. It supports several encodings and uses dead key input method, like in "Polytonic Greek" keyboard layout from Windows 2000/XP distribution.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    GitSync is a shell script designed to simplify the usage of the version control system GIT (see www.git-scm.com for more information) by providing a "do everything to sync my repository" command.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Visual xsltproc is a tool which help to write xslt file, and debug it to find errors. It writes xml, and generates xml (Syntax highlighting of XML & line Nr.). Finally if the result is XSL-FO it generates the pdf on Apache FOP java. Build on QT4.2.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    A utility to read/write from: pipes, sockets, standard handles, files, block devices, NT Native API objects, file mapping/section objects, comm ports, etc. Supports various text/data conversions, multiple inputs/outputs, and various IO types/options.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Boarded - on-screen keyboard for X11
    Boarded is a virtual on-screen keyboard for the X11 window system. It's intended to be used on mobile devices without a hardware keyboard. It allows you to design your own keyboard layouts, specifying key positioning/grouping and setting own colors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A backup/transfer files program to move and process files between computers in a production (industrial) process. The processing of file begins when it are created or copied to directory and they can be redirected to other machine to be used.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    CTA (conversor de ficheros de texto) is a program to change the format of one or multiple text files between unix text file format and dos/windows text file format. With this easy program you can see text files correctly, whitout annoying symbols or comp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The FileNamer for EML allows you to rename a lot of .eml files like 001.eml, 002.eml, 003.eml to something more descriptive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.