Skip to content
View dataabc's full-sized avatar

Block or report dataabc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web

JavaScript 25,391 1,910 Updated Dec 21, 2025
Java 2,763 346 Updated Dec 16, 2025

A Python 3 library for generating Anki decks

Python 2,484 189 Updated Dec 30, 2024

学习笔记

Python 636 232 Updated Jun 19, 2019

A GitHub App built with Probot that closes abandoned Issues and Pull Requests after a period of inactivity.

JavaScript 1,265 169 Updated May 20, 2023

新浪微博爬虫,用python爬取新浪微博数据

Python 7 1 Updated Sep 21, 2025

基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标…

Python 436 127 Updated Oct 30, 2024

Best Practices on Recommendation Systems

Python 21,269 3,274 Updated Dec 16, 2025

TensorFlow documentation

Jupyter Notebook 6,281 5,366 Updated Dec 2, 2025

Hexo PWA plugin

JavaScript 135 15 Updated Jun 14, 2021

My Blog / Jekyll Themes / PWA

HTML 7,605 6,469 Updated Feb 10, 2025

极速加载的Hexo主题,不引入第三方JS库

EJS 324 37 Updated Mar 5, 2021

An Open Source Machine Learning Framework for Everyone

C++ 192,885 75,169 Updated Dec 21, 2025

A Python scikit for building and analyzing recommender systems

Python 6,740 1,049 Updated Jul 24, 2025

新闻网站爬虫,目前能够爬取网易,新浪,qq,搜狐等三家网站的新闻页面,并保存到本地。

Python 34 25 Updated Jun 12, 2015

A dynamic configurable news crawler based Scrapy

Python 165 72 Updated Jul 24, 2017

基于scrapy的新闻爬虫

Python 102 34 Updated Apr 18, 2020

python scrapy 企业级分布式爬虫开发架构模板

Python 95 59 Updated Mar 1, 2018

BP神经网络分类器

Java 128 78 Updated Mar 22, 2016

python的websocket server

Python 83 67 Updated Nov 14, 2025

结巴中文分词

Python 34,656 6,733 Updated Aug 21, 2024

基于 Python3.5 和 Django 1.10 的 Django Blog 项目。

CSS 2,370 874 Updated Jun 10, 2021

使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现

Python 3,257 1,580 Updated Apr 18, 2017

More than 80,000 Chinese Internet company's information.

82 18 Updated Oct 14, 2016

aaa

1 Updated Oct 19, 2015