Alessandro Zanella
  • Home
  • About
  • Skills
  • Experiences
  • Github
  • LinkedIn
  • White Hat Wiki
    • Practical Guide To Pentesting
  • Project Ideas
    • Internal Link Crawler
  • TOM for Financial Freedom
  • Personal Pentesting Helper App
Hero Image
Internal Link Crawler/Scraper

A Python Scraper/Crawler App That Check a Websites Page Source Code for internal Links and visits those links to gather more links (With Compliancy & Safety measures) Compliance & Safety Using web crawlers like the Python script I provided is not inherently illegal, but it depends on how and where you use them. Here are the key legal and ethical considerations: 1. Check the Website’s robots.txt File Most websites have a robots.txt file (e.g., https://example.com/robots.txt) that specifies which pages can or cannot be crawled. While robots.txt isn’t legally binding in most cases, ignoring it may violate terms of service.

Monday, January 1, 1 Read
Navigation
  • About
  • Skills
  • Experiences
  • Github
  • LinkedIn
Contact me:
  • zan15592@gmail.com
  • LinkedIn
  • +40 791354045