Browse free open source XML software and projects below. Use the toggles on the left to filter open source XML software by OS, license, language, programming language, and project status.

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 1
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 47 This Week
    Last Update:
    See Project
  • 2
    PHP-XML is a class written in php to create, edit, modify and read XML documents.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    MOVED: http://code.google.com/p/rssfilter/ Selects and/or rejects articles from RSS feeds based on keywords you supply. Runs in your taskbar! Originally intended to filter new content from scientific journals such as PRL or the arxiv (cond-mat, etc)
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Digital Learning Sciences (DLS) is a mission-centered, not-for-profit organization dedicated to improving learning through the use of digital content and tools.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Standard-Sitemap Project
    The Standard-Sitemap Navigator is a Firefox extension to help navigate websites. The webmaster of a site can provide a navigational sitemap in XML, and this extension fetches it and displays it as you visit.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    People Connection Art Portal. To connect Art creators and Art consumers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    A HTML scraper that uses machine learning frameworks to extract labelled fields from raw HTML. The project also involves the development of a tool to display the semi structured data generated by the scraper component.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 10
    GsgXml is a class used to build XML sitemaps for the Google Sitemaps service
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Image2DocInfo has been made to quickly tag digital pictures. A GUI allows you to set attributes for an image, and then store them in XML files. Those files follow the Dublin Core naming scheme and are stored in the same directories than the pictures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Infofuze

    Data migration/conversion library based on STX and XSLT transformation

    Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    The Java-Sitemapper is a Java API for building sitemap files to improve search indexing on Google, Yahoo!, MSN, and Ask.com. This project strives to implement the latest in search technology for use on the Java platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The Jorne project develops software and open standards for linking Lojban text with WWW and Semantic Web metadata (e.g. RDF/N3, RSS, XML). Lojban is an artificial spoken and written language based on predicate logic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    LUPUS (LUPUS Universal Plattformindependent Update System) is a powerful tool for system-administrators. It's some kind of mixture of apt, dpkg, subversion, cfengine and qmake ;-)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Methanol is a scriptable multi-purpose web crawling system with an extensible configuration system and speed-optimized architectural design. Methabot is the web crawler of Methanol.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    OpenMKS is a search & navigational tool for large multimedia collections. With pluggable functionality and a core subsystem supporting the z39.50 ZING Community SRW search & retrieval specification, it can be run either as a Servlet or as a Web Service.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    POPsearch is a desktop search engine that's designed to help you find information on your computer. This information can then be accessed remotely with RSS feeds, email feeds, or from any computer that has a web browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PSE (Personal Site Engine) is a small PHP/XML based engine proposed for building sites. It's provide a simple and a fast way to create an web site on an empty space.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Generates RDF and RDFS ontology documents automatically from HTML pages once given a set of rules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The Retrieval Component Integrator Project (RECOIN) intends to provide an extensible framework of Java classes to build a meta-search and information retrieval (IR) system based on heterogenous IR components as part of a modular retrieval process. The so
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This is a collection of REST specifications, and implementations of those specs, for very low-level information sharing and workflow operations using REST actions over HTTP. Implementations are in various languages, mainly Java, Python, and Ruby.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SAFOX - Simple API For XML Handling php project merging the DOM approach with XML/php, It provides a simple OOD API for php-based XML generation, parsing, manupilation, traversal etc. Currently based on three classes, SAFOX is true OOP approach to xml Wi
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A robust website scraping framework that uses XML, XPath, RegEx and scripting to consume, parse, normalize and traverse HTML based on a set of seed URLs. Scrape.NET is built using C#, TidyForNet (the p-invoke only version) and HTML Tidy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    ScraperEdit for XBMC

    XML bindings and a GUI for creating and editing XBMC Scrapers

    This program is an editor for creating XBMC Scrapers. It is similar to ScraperEditor, an other editor using ScraperXML, that runs under .Net environment. This program runs under Sun/Oracle's Java Runtime. HELP WANTED! I am looking for someone, who would help me writing documentation, like user's manual and on-line help. Also if someone want to help, translated language files are always welcome...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.