Open Source Text Processing Software - Page 8

Text Processing Software

  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 1
    Fen2eps is a small console program that converts FEN (Forsyth Edwards Notation) strings to EPS (Encapsulated Postscript) files containing the chess board diagram.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    FictionVocabulary is a tool for counting words in text files. It uses vocabularies with word sets e.g. 1000 most useful words etc. Counting occurs through vocabularies, so word list consists only of words which are not represent in vocabularies
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    FileExtender is a Perl script to evaluate embedded SQL statements in any kind of text file (incl. HTML files) and extends these files with results from the database queries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    The FileNamer for EML allows you to rename a lot of .eml files like 001.eml, 002.eml, 003.eml to something more descriptive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete feature list.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Java library for reading and writing of flat files. CSV, FLR (fixed length record) or mixed structures. Tree-style processing API. Adapters for SAX, Stax and XStream for transformation, data binding or serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Flywheel is a multi-purpose script engine. Define once, publish anywhere. Only seven template commands to learn. Full Java classpath access to functions through the Execute command. Iterate through lists with the List command.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    The most powerful non-commercial translation memory software (TM tool) with enhanced capabilities, like networking/collaboration (http, rpc), encoding conversion, project management capabilities, email capability with attachments, file tree diff etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser.
    Download Chrome
  • 10
    Fsgrep is a simple Java application which allows a user to search all files in a directory structure for lines matching a given pattern. Its functionality is a combination of the Unix find and grep utilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Eclipse WTP is the base project for many projects,includes PDT(PHP),MyEclipse. gmapwtp make Eclipse WTP support Google Map API Content Assistance.for examle you input gmap. and then editor will show you methods and properties of JavaScript Class GMap2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GNU FriBidi is the Free Implementation of the Unicode Bidirectional Algorithm. GNU FriBidi development has been moved to GitHub. See https://github.com/fribidi/fribidi/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    This project has MOVED to http://savannah.gnu.org/projects/libiconv/ !!! This library provides an iconv() implementation, for use on systems systems which don't have one, or whose implementation cannot convert from/to Unicode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    GOSH takes an Usenet-styled ASCII text as input and generates Latex-code, HTML, or Man-pages. Its transformation is easily customizable, which makes GOSH suited for creating web content, papers, letters, and technical documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A Java application for statistical analysis and systematic manipulation of natural language texts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Webbased groupware written in php,mysql. Supports log,events,project management, calendar, todolist, contact management, cvs (file management). Next version includes a rss feed
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This project offers galician dictionaries for several spell checkers: "Ispell", "Myspell" , "Aspell", "Spell Checker for Edit Boxes" and "Excalibur".
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    GenDiapo is renamed GenDoc. GenDoc is a XML Editor written in Java2. It is based on a existing project (MerlotXML now called Xerlin), and can use two kinds of plugins (DTD and/or action) for editing and publishing
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Gorille is a small Java package designed to let developers of various kinds of XML processors test the content and names of XML structures in their XML documents. Gorille includes test files for XML 1.0 and the draft XML 1.1, or you can create your own.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Gothe is a writing aid in picking the most appropriate prepositions or synonyms in a text. It does this by checking the frequency of appearance of different combination on Google.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A multiplatform visual implementation of the Unix utility grep
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Guiguts is a Perl/Tk text editor designed for editing and formatting public domain material for inclusion at Project Gutenberg (www.gutenberg.org). Features are provided for editing text files produced by Distributed Proofreaders (www.pgdp.net). For help or to contact the developers, see http://www.pgdp.net/phpBB2/viewtopic.php?t=46944
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This script converts books from the open Gutenberg project base text format into a format easily readable by an iPod's notes feature. This project is written in Python and is so accessible across many Operating systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Downloadable and open source Chinese-Spanish vocabulary inspired by the CEDICT and EDICT dictionaries. It is distributed in a plain Unicode text file that can be easily ported to other formats or used by different applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Strip out useless tags and other junk from HTML files. Shrink files, enhance readability of HTML source, promote privacy, and clean HTML exported from Microsoft Word (MS-Word). Run HTMLStrip as-is or customize it with your own regular expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.