-
Notifications
You must be signed in to change notification settings - Fork 86
Closed
Labels
OutreachyAvailable for Outreachy participantsAvailable for Outreachy participantsfeatureNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is needed
Description
Terms
- I have searched open and closed feature requests
- I agree to follow Scribe-Data's Code of Conduct
Description
This issue will be the first issue to add dump processing functionality to the Scribe-Data CLI. In it, we'll do the following:
- We'll add in the
--wikidata-dump (-wd)argument to the total command - If the user passes this argument, the Add
check_lexeme_dump_prompt_downloadfunction to cli/utils.py #518 functionality will be passed to make sure that a dump is available or download one - From there, the functionality of the total command will be ran over the dump rather than the via the Wikidata query service
- This functionality will be added into a file
src/scribe_data/wikidata/parse_dump.pyand called from the CLI
- This functionality will be added into a file
Before starting, we should map out the best way to process the dump, with a specific question being whether we need to uncompress the dump or whether we can work directly from the compressed .json.bz2 file.
Contribution
@axif0 will be working on this as a part of Outreachy! 📶🚤
axif0
Metadata
Metadata
Assignees
Labels
OutreachyAvailable for Outreachy participantsAvailable for Outreachy participantsfeatureNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is needed
Type
Projects
Status
Done