Geocities is soon closing its doors forever. If you have a genealogy site on Geocities, you may want to read
- My Genealogy Site is on Geocities. Now What Do I Do? Part I
- My Genealogy Site is on Geocities. Now What Do I Do? Part II
- How to fnd genealogy on Geocities sites.
Those who want to save copies of their Geocities sites before it closes have a couple of choices:
- Copy each page manually using the Geocities File Manager
- Use a program that automatically copies the whole website to your hard drive.
Copying each page of a site individually might be OK for a small site, but for a larger site, it would be very tedious and extremely time consuming.
I decided to try the free http://www.httrack.com/ to copy a Geocities website. Httrack is meant to be used to copy sites to browse them off-line, but is also ideal for copying a site for which there is no FTP such as Geocities sites.
I downloaded the Windows version of httrack and tried it to copy a couple of my Geocities websites.
The first site I copied was small - about 25 pages. Hhtrack copied my site to my hard drive in only a few minutes. Note: Httrack copies the linked pages only, so that if you have some pages or graphics in the Geocities file manager that are not linked to other pages, they will have to be copied manually.
Then I tried a larger site that had over 700 pages. Httrack started copying this large Geocities site, but when it finished, I noticed I had many errors in the error log and saw that the whole site wasn't copied correctly.
Part way through the copying of the site, the httrack error log started to show the error message "Service Temporarily Unavailable" for each page. I realized that once Httrack reaches the Geocities' bandwidth restriction, it copies the error page that Geocities displays instead of the page on your website. Thinking that perhaps Httrack's "resume download" feature would resume the download from when I started getting error messages, I found I was wrong. Since it copied the whole site as it was displayed (albeit with many Geocities error pages noting that the bandwidth restriction had been reached), Httrack started to copy at the beginning instead of resuming and I quickly ran into the bandwidth restriction again.
Httrack does allow copying one directory at a time. I was able to copy directory by directory spaced out over time so as not to invoke the Geocities limit. Make sure each copy is at least an hour apart.
If all of your files are in one directory, you can move them into separate directories for the purpose of being automatically copied. Just click on the check box for each page you want to move into a directory, then select move to a directory.
Httrack does a good job of copying the linked pages of a website to your computer. However, you must manually copy anything from the file manager that is not linked.
And if you have a large website, Geocities bandwidth restriction will keep you from automatically copying the whole thing. Check the error log for problems and check the pages you copied onto your hard drive to make sure you have your website pages and NOT the Geocities error page. You don't want to find out too late that you haven't successfully copied your whole site and have a couple hundred pages of Geocities error pages instead of your site.