Skip to content

added netloc support for hdfs URIs#168

Open
vvaten wants to merge 1 commit into
piskvorky:developfrom
vvaten:hdfs_netloc_support
Open

added netloc support for hdfs URIs#168
vvaten wants to merge 1 commit into
piskvorky:developfrom
vvaten:hdfs_netloc_support

Conversation

@vvaten

@vvaten vvaten commented Jan 2, 2018

Copy link
Copy Markdown

removed the special handling of HDFS URIs which was against the URI specification

fixes #162

…DFS URIs which was against the URI specification
@mpenkov

mpenkov commented Jan 2, 2018

Copy link
Copy Markdown
Collaborator

@vvaten Thank you for your pull request. It looks good to me.

@menshikh-iv Might be a good idea to merge this after our HDFS integration tests are up. What do you think?

@menshikh-iv

Copy link
Copy Markdown
Contributor

@mpenkov I agree.
@vvaten thanks for PR! Sorry for waiting, but first we need to finish #151.

* file:///home/user/file
* file:///home/user/file.bz2

NOTE: hdfs://path/file does no longer work as it is against the URI

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add more information to comment (when this happens, what HDFS version affected, etc)?

@vvaten vvaten Feb 15, 2018

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a generic change that affects all HDFS versions. They do support hdfs://host/path/file URI format. Using hdfs://path/file in smart_open violates this and also violates the URI specification (RFC3986) where the hostname part is always after the '://'. The correct way to refer to local content is hdfs:///path/file instead of hdfs://path/file.

@mpenkov mpenkov added the stale No recent activity from author label Sep 28, 2019
@mpenkov mpenkov self-assigned this Sep 28, 2019
@mpenkov mpenkov changed the base branch from master to develop April 25, 2020 09:04
@govindmurthi21

Copy link
Copy Markdown

Hello is there any way this can be merged ? As part of my work I was trying to read and write to hdfs and traced the bug to this
Hdfs commands require a fully qualified uri such as ‘’hdfs dfs ls hdfs://path’’

@ddelange

Copy link
Copy Markdown
Collaborator

Hi @vvaten 👋

I know it's been a while. It looks like this PR is still valid, only outdated. Do you want to pull develop and get this merged?

@ddelange ddelange mentioned this pull request Jun 8, 2026
17 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement stale No recent activity from author

Projects

None yet

Development

Successfully merging this pull request may close these issues.

HDFS URI support does not support network location

5 participants