How to use the goose3.Configuration function in goose3

To help you get started, we’ve selected a few goose3 examples, based on popular ways it is used in public projects.

Secure your code as it's written. Use Snyk Code to scan source code in minutes - no build needed - and fix issues immediately.

github fanmatics / metadoc / metadoc / extract / extractor.py View on Github external
self.title = title or None
    self.entities = []
    self.keywords = []
    self.names = []
    self.fulltext = None
    self.language = None
    self.description = None
    self.canonical_url = None
    self.image = None
    self.published_date = None
    self.modified_date = None
    self.scraped_date = None
    self.contenthash = None
    self.reading_time = None

    config = Configuration()
    config.enable_image_fetching = False
    self.goose = Goose(config=config)

    self.tree = None

goose3

Html Content / Article Extractor, web scrapping for Python3

Apache-2.0
Latest version published 3 months ago

Package Health Score

74 / 100
Full package analysis