We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue by monstrfolk Sun Dec 13 01:34:57 2020 Originally opened as codelucas/newspaper#862
Perhaps misunderstand the relationship from clean_top_node to clean_doc or doc, but cannot transverse from clean_top_node to clean_doc or doc.
For example, following will not work.
a = Article('https://somesite.com/some_article') a.download() a.parse() print(a.clean_doc.getroottree().getpath(a.clean_top_node))
Expect to be able to print the path from clean_doc/doc to clean_top_node.
The text was updated successfully, but these errors were encountered:
Comment by monstrfolk Sun Dec 13 01:36:21 2020
Please see codelucas/newspaper#863 with a fix for this issue.
Sorry, something went wrong.
ensured that cleaned_doc and cleaned_top_node are on the same DOM also, doc and top_node are on another DOM together.
Added a Article.text_clean property that returns the cleaned text of an article based on the clean_top_node.
No branches or pull requests
Issue by monstrfolk
Sun Dec 13 01:34:57 2020
Originally opened as codelucas/newspaper#862
Perhaps misunderstand the relationship from clean_top_node to clean_doc or doc, but cannot transverse from clean_top_node to clean_doc or doc.
For example, following will not work.
a = Article('https://somesite.com/some_article')
a.download()
a.parse()
print(a.clean_doc.getroottree().getpath(a.clean_top_node))
Expect to be able to print the path from clean_doc/doc to clean_top_node.
The text was updated successfully, but these errors were encountered: