10 releases

0.1.9 Mar 12, 2021
0.1.8 Jun 1, 2020
0.1.7 Apr 18, 2020

#15 in #scraping

Download history 260/week @ 2025-04-09 230/week @ 2025-04-16 400/week @ 2025-04-23 341/week @ 2025-04-30 302/week @ 2025-05-07 189/week @ 2025-05-14 200/week @ 2025-05-21 141/week @ 2025-05-28 202/week @ 2025-06-04 220/week @ 2025-06-11 366/week @ 2025-06-18 447/week @ 2025-06-25 425/week @ 2025-07-02 470/week @ 2025-07-09 816/week @ 2025-07-16 942/week @ 2025-07-23

2,708 downloads per month
Used in 16 crates (13 directly)

MIT/Apache

225KB
2K SLoC

Nipper

A crate for manipulating HTML with Rust.

Nipper based on HTML crate html5ever and the CSS selector crate selectors. You can use the jQuery-like syntax to query and manipulate an HTML document quickly. Not only can query, but also can modify.

nipper-logo

Example

Extract the hacker news.

use nipper::Document;

fn main() {
    let html = include_str!("../test-pages/hacker_news.html");
    let document = Document::from(html);

    document.select("tr.athing").iter().for_each(|athing| {
        let title = athing.select(".title a");
        let href = athing.select(".storylink");
        println!("{}", title.text());
        println!("{}", href.attr("href").unwrap());
        println!();
    });
}

Readability.

examples/readability.rs

License

Licensed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Dependencies

~3–9MB
~76K SLoC