Skip to content

Secure scraping guide #1872

@janbuchar

Description

@janbuchar

We should mention at least

  • using isolation (separate containers, VMs, ...),
  • not running scrapers next to mission critical software,
  • websites being scraped are untrusted input,
  • browsers (via playwright) execute untrusted code, which means that they are essentially giant security holes.

Metadata

Metadata

Assignees

Labels

documentationImprovements or additions to documentation.t-toolingIssues with this label are in the ownership of the tooling team.

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions