Abstract

AbstractOn the Internet full of big data, crawlers can greatly improve the efficiency of information search. This paper briefly introduced Python and the hacker attack technology and crawler program based on Python. After that, the web hacker attack program was embedded into the crawler program to crawl the deeper hidden information restricted by permission. Then, the conventional crawler program, C language crawler program, and Python‐based crawler program were tested. The results showed that the normal crawler program failed to crawl the complete information data when facing the web page with permission restriction; the C language crawler program has a low efficiency though it could crawl the hidden information; the Python‐based crawler program obtained the complete information data after bypassing the permission by Structured Query Language (SQL) injection of the web hacker attack module. To sum up, applying Python can write crawler programs in a relatively simple way and embed a hacker attack program to crawl the hidden information on the web.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call