Due: Thursday Feb 15, 2006; 10:00PM
Web Dumper
Write a Python program that takes a web page URL and downloads the page and any linked images to the user's local computer. You may have used such a facility in a web browser like Internet Explorer -- the ability to save an entire HTML file including linked images.
I will leave much of the operation of this program up to your creativity. As a minimum, though, it must support the following features correctly:
The program should take a URL as command-line argument. (You may provide other command-line options if you wish.)
The images should be downloaded into a directory in the same location of the saved HTML file. The organization of images in that directory (i.e. by using subdirectories) is up to you. The name of the top-level image directory should be derived from the name of the HTML file.
You must also make sure to update the image tags in the saved HTML file to reference the locally download images.
Your program should recognize at least GIF and JPG image files.
Along with well-commented/documented source code file(s), your final submission should include a detailed README file describing:
- The operation of your program (from a user's perspective).
- What features your program supports.
- Known bugs or shortcomings.
- Challenges you faced in the course of the assignment and how you resolved them (or not).
Submit
Submit your Python script in the prog02 submit folder for this class.