Showing 77 open source projects for "dom parser"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 1
    html-react-parser

    html-react-parser

    HTML to React parser

    HTML to React parser that works on both the server (Node.js) and the client (browser). The parser converts an HTML string to one or more React elements. Available as part of the Tidelift Subscription. For TypeScript projects, you may need to check that domNode is an instance of domhandler's Element. Make sure to render parsed adjacent elements under a parent element.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    htmlparser2

    htmlparser2

    The fast & forgiving HTML and XML parser

    The fast & forgiving HTML and XML parser. htmlparser2 is the fastest HTML parser, and takes some shortcuts to get there. If you need strict HTML spec compliance, have a look at parse5. htmlparser2 itself provides a callback interface that allows the consumption of documents with minimal allocations. While the Parser interface closely resembles Node.js streams, it’s not a 100% match. Use the WritableStream interface to process a streaming input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    hyperx

    hyperx

    Tagged template string virtual dom builder

    tagged template string virtual dom builder. This module is similar to JSX, but provided as a standards-compliant ES6 tagged template string function. hyperx works with virtual-dom, react, hyperscript, or any DOM builder with a hyperscript-style API: h(tagName, attrs, children). You might also want to check out the hyperxify browserify transform to statically compile hyperx into javascript expressions to save sending the hyperx parser down the wire. Template strings are available in: node 4...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    jsoup

    jsoup

    Java library for working with real-world HTML

    jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make every...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    PHP7

    PHP7

    PHP7 / Laravel Multi-format Streaming Parser

    When it comes to parsing XML/CSV/JSON/... documents, there are 2 approaches to consider. DOM loading loads all the documents, making it easy to navigate and parse, and as such provides maximum flexibility for developers. Streaming implies iterating through the document, acts like a cursor, and stops at each element in its way, thus avoiding memory overkill. Thus, when it comes to big files, callbacks will be executed meanwhile file is downloading and will be much more efficient as far as memory...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Cheerio

    Cheerio

    Implementation of core jQuery designed for the server

    Fast, flexible & lean implementation of core jQuery designed specifically for the server. Cheerio implements a subset of core jQuery. Cheerio removes all the DOM inconsistencies and browser cruft from the jQuery library, revealing its truly gorgeous API. Cheerio works with a very simple, consistent DOM model. As a result parsing, manipulating, and rendering are incredibly efficient. Cheerio wraps around parse5 parser and can optionally use @FB55's forgiving htmlparser2. Cheerio can parse nearly...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    AngleSharp

    AngleSharp

    The ultimate angle brackets parser library parsing HTML5, MathML, SVG

    AngleSharp follows the W3C specifications and gives you the same results as state of the art browsers. Besides the official API AngleSharp adds some useful extension methods on top. This makes working with the DOM convenient. AngleSharp integrates everything you need to explore and mutate the DOM tree. Node retrieval is straight forward by using powerful CSS query selectors. The CSS queries in AngleSharp are super fast and very simple to use. AngleSharp respects the relationship of HTML...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. "Native...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    goquery

    goquery

    A little like that j-thing, only in Go

    goquery brings a syntax and a set of features similar to jQuery to the Go language. It is based on Go's net/HTML package and the CSS Selector library Cascadia. Since the net/html parser returns nodes, and not a full-featured DOM tree, jQuery's stateful manipulation functions (like height(), css(), and detach()) have been left off. Also, because the net/HTML parser requires UTF-8 encoding, so does goquery: it is the caller's responsibility to ensure that the source document provides UTF-8...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 10
    xml2js

    xml2js

    XML to JavaScript object converter

    xml2js is a Node.js module that converts XML into JavaScript objects (and vice versa). It simplifies XML parsing by using pure JavaScript and supports both synchronous and asynchronous parsing. It’s a lightweight alternative to heavy DOM parsers. You can create one xml2js.Parser per file. That's the recommended one and is promised to always just work. You can call reset() on your parser object. You can hope everything goes well anyway. This behaviour is not guaranteed to work always, if ever.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    pdf-extractor

    pdf-extractor

    Node.js module for rendering pdf pages to images, svgs and HTML files

    Pdf-extractor is a wrapper around pdf.js to generate images, svgs, html files, text files and json files from a pdf on node.js. A DOM Canvas is used to render and export the graphical layer of the pdf. Canvas exports *.png as a default but can be extended to export to other file types like .jpg. Pdf objects are converted to svg using the SVGGraphics parser of pdf.js. Pdf text is converted to HTML. This can be used as a (transparent) layer over the image to enable text selection. Pdf text...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    PHP Simple HTML DOM Parser

    A php based DOM parser.

    A simple PHP HTML DOM parser written in PHP5+, supports invalid HTML, and provides a very easy way to find, extract and modify the HTML elements of the dom. jquery like syntax allow sophisticated finding methods for locating the elements you care about.
    Leader badge
    Downloads: 1,669 This Week
    Last Update:
    See Project
  • 13

    sxmlc

    Simple, lightweight XML parser in C, statically or dynamically linked.

    Simple XML parser written in C, with limited XPath and Unicode support and customizable tags. 'Simple' means that it does not implement all XML specifications, only the most widely used ones.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PSD.js

    PSD.js

    A Photoshop PSD file parser for NodeJS and browsers

    psd.js is an open-source JavaScript library (CoffeeScript/JS) for reading and parsing Adobe Photoshop PSD files in both Node.js and web browsers. It reconstructs the document into a DOM-like tree with layers, masks, text metadata, vector information, and pixel data. It enables developers to traverse layer structures, extract flattened or individual image buffers, and integrate PSD assets programmatically in web or backend applications.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    JSSoup

    JSSoup

    JavaScript + BeautifulSoup = JSSoup

    I'm a fan of Python library BeautifulSoup. It's feature-rich and very easy to use. But when I am working on a small react-native project, and I tried to find a HTML parser library like BeautifulSoup, I failed. So I want to write a HTML parser library that can be so easy to use just like BeautifulSoup in Javascript. JSSoup uses tautologistics/node-htmlparser as HTML dom parser, and creates a series of BeautifulSoup like API on top of it. JSSoup supports both node and react-native. JSSoup tries...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Jodd

    Jodd

    Jodd! Lightweight, Java, zero dependencies, use what you like

    Jodd is a set of micro-frameworks and developer-friendly tools and utilities. Code-things-done. Great performances. Tiny dependencies. Use what you like. Welcome to the Jodd, a unique, alternative world of Java. You can find here some micro-frameworks and some handy tools. Nothing more, nothing less. The decision which component is promoted is based on the Maven Central Repo statistics. Version v5 contains remaining libraries and frameworks. They are in maintenance mode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    jul-javascript

    jul-javascript

    JUL - The JavaScript UI Language

    The JavaScript UI Language (JUL) is a configuration and instantiation module for the JavaScript frameworks. It can be used with any framework that accepts a configuration object as a constructor parameter, or with a custom factory for the other frameworks. JUL parses a tree of configuration objects, and creates the runtime components in the expected order and membership. For the most uses, after this step, you will have your application's user interface up and running. Browser...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    undom

    undom

    1kb minimally viable DOM Document implementation

    Minimally viable DOM Document implementation. A bare-bones HTML DOM in a box. If you want the DOM but not a parser, this might be for you. JSFiddle Demo: Rendering preact components into an undom Document. Undom aims to find a sweet spot between size/performance and utility. The goal is to provide the simplest possible implementation of a DOM Document, such that libraries relying on the DOM can run in places where there isn't one available. The intent to keep things as simple as possible means...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    VTD-XML is the next generation XML parser/indexer/editor/slicer/assembler/xpath-engine that goes beyond DOM, SAX and PULL in performance, memory usage, and ease of use.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 20
    RapidJSON

    RapidJSON

    A fast JSON parser/generator for C++ with both SAX/DOM style API

    RapidJSON is a JSON parser and generator for C++. It was inspired by RapidXml. RapidJSON is small but complete. It supports both SAX and DOM style API. The SAX parser is only a half thousand lines of code. RapidJSON is fast. Its performance can be comparable to strlen(). It also optionally supports SSE2/SSE4.2 for acceleration. RapidJSON is self-contained and header-only. It does not depend on external libraries such as BOOST. It even does not depend on STL. RapidJSON is memory-friendly. Each...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21

    OXml

    The next generation XML library for Pascal (Delphi, FPC, Lazarus)

    ... and compilers (also for Delphi 4-2007). - The XML parser is a standalone unit, making it possible to write different and user-defined XML interfaces. - Extremely fast thanks to clever buffering. Parsers included in OXml: - Basic standalone XML reader and writer. - DOM (W3C DOM Level 1.0 Specification). - Sequential DOM parser - SAX parser
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    PHP Advanced HTML DOM Parser

    A drop-in replacement for simple html dom

    This project has moved here https://github.com/monkeysuffrage/advanced_html_dom
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Roomba

    Roomba

    A Node.js tool to examine the correctness of Open Data Metadata

    Linked Open Data (LOD) has emerged as one of the largest collection of interlinked datasets on the web. Benefiting from this mine of data requires the existence of descriptive information about each dataset in the accompanying metadata. Such meta information is currently very limited to few data portals where they are usually provided manually thus giving little or bad quality insights. To address this issue, we propose a scalable automatic approach for extracting, validating and generating...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Products of the project: Java HTMLParser - VietSpider Web Data Extractor - Extractor VietSpider News. Click on "Show project details" to see more feature about each product.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    HTML XHTML Parser + XPath

    Delphi HTML XHTML Parser +XPath

    Delphi HTML Parser This module lets you work with HTML documents as DOM tree and use XPath for searching tags. It is very simple way to parse HTML. This tested with version Delphi XE5,6 Usage Add in Uses parser.pas; begin HtmlTxt:= ''; //here your html NodeList:= TNodeList.Create; ValueList:= TStringList.Create; DomTree:= TDomTree.Create; DomTreeNode:= DomTree.RootNode; If DomTreeNode.RunParse(HtmlTxt) then begin {your code example
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.