Skip to content

.text may guess the encoding incorrectly #163

@097115

Description

@097115

Steps to reproduce:

import requests
from readability import Document
response = requests.get('https://polit.ru/article/2021/09/14/ps_dennet/')
print(Document(response.text).summary())

However, if we use .content:

    print(Document(response.content).summary())

everything will be just fine.

May be updating README.rst is worth a shot :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions