trafilatura

v1.9.0

Python package and command-line tool designed to gather text on the Web, includes all necessary discovery and text processing components to perform web crawling, downloads, scraping, and extraction of main texts, metadata and comments. For more information about how to use this package see README

Latest version published 21 days ago
License: Apache-2.0

Ensure you're using the healthiest python packages

Snyk scans all the packages in your projects for vulnerabilities and provides automated fix advice

Package Health Score

88 / 100

Popularity

Influential project
GitHub Stars
2.93K
Forks
223
Contributors
40

Direct Usage Popularity

TOP 30%

Based on project statistics from the GitHub repository for the PyPI package trafilatura, we found that it has been starred 2,932 times.

Security

No known security issues
Powered by Snyk
1.9.0 (Latest)

Security and license risk for latest version

Release Date
May 2, 2024
Direct Vulnerabilities
  • 0
    C
  • 0
    H
  • 0
    M
  • 0
    L
Indirect Vulnerabilities
  • 0
    C
  • 0
    H
  • 0
    M
  • 0
    L
License Risk
  • 0
    H
  • 1
    M
  • 0
    L
All security vulnerabilities belong to production dependencies of direct and indirect packages.

License
Apache-2.0

Security Policy
No

We found a way for you to contribute to the project! Looks like trafilatura is missing a security policy.


You can connect your project's repository to Snyk to stay up to date on security alerts and receive automatic fix pull requests.

Keep your project free of vulnerabilities with Snyk

Maintenance

Healthy

Commit Frequency

Open Issues
63
Open PR
5
Last Release
21 days ago
Last Commit
8 days ago

Further analysis of the maintenance status of trafilatura based on released PyPI versions cadence, the repository activity, and other data points determined that its maintenance is Healthy.

We found that trafilatura demonstrates a positive version release cadence with at least one new version released in the past 3 months.

As a healthy sign for on-going project maintenance, we found that the GitHub repository had at least 1 pull request or issue interacted with by the community.

Community

Active
Readme
Yes
Contributing.md
Yes
Code of Conduct
No
Contributors
40
Funding
Yes

With more than 10 contributors for the trafilatura repository, this is possibly a sign for a growing and inviting community.

We found a way for you to contribute to the project! Looks like trafilatura is missing a Code of Conduct.


Embed Package Health Score Badge

package health: 88/100 package health 88/100

Package

Python Versions Compatibility
>=3.6

Age
5 years
Latest Release
21 days ago
Dependencies
1 Direct / 19 Total
Versions
44
Maintainers
1
Wheels
OS Independent