Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using Pywsd in other languages (french, or others) #29

Open
GMarzinotto opened this issue Feb 7, 2017 · 3 comments
Open

Using Pywsd in other languages (french, or others) #29

GMarzinotto opened this issue Feb 7, 2017 · 3 comments

Comments

@GMarzinotto
Copy link

Good afternoon,

I was wondering if it would be possible to adapt this tool to other languages such as French or Spanish.
If it is feasible, could you give me some indications on how to do these modifications?

Thank you very much!

@alvations
Copy link
Owner

@GMarzinotto it's a good suggestion. But the bulk of the pywsd still relies on the various lesk algorithm. To extend the code to other languages we have to first get translations for the glosses (i.e. definitions) for every Synset.

@geekan
Copy link

geekan commented Feb 22, 2017

@alvations
Question: how to generate the Synset and glosses?

@alvations
Copy link
Owner

@geekan Do you mean "how to generate synset and glosses" for other languages? If so, then the first thing is to take synsets from Open Multilingual Wordnet (OMW) that maps the synsets from the other languages to the princeton IDs then somehow translate the glosses from English to the other languages (manually/automatically).

Just a straw poll, is scaling to other languages more important to implementing more state-of-art algorithms for English? If so, BabelNet would be an option, but it's non-commerical =(

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants