Tuesday, February 13, 2007

How to recover a website from the cache of Google, Internet Archive, Yahoo, MSN

I tried to change several websites by code.I meant to change some pages automatically for they had the same problem.But when the program finished to do so, it was not the result I expected. All the PHP files on the sever were changed to size 0.It's a bad day...Maybe it's something wrong with the server.I just called the server company and asked if they can recover all these size 0 files.They told me it's impposble, they did not have back-ups.What a bad day...bad severice...What should I do?I just thought something about google cache.All of these websites were indexed by Google, Internet Archive, Yahoo, MSN.Can I recover them from these caches?Really a good idea.But how to?I know it's possible to visit all the caches and download the cache files manually.But it's really a large project.I don't want to do this.Maybe I am very lazy.So I go to google for more information.Suddenly Google take me to Warrick, my savior, it's a perl program which is writen by Frank McCown. Soonly I learned how to use it and went to recover my site.The program is very powerful, but the recovery time may be a little long, but it's OK.Really Thanks Frank McCown, thanks very much for good research...

Have you met the same problem with me?Hope it will help you...Good Luck...

3 comments:

MaTT said...

Well this is not quite what you were looking for but its helpful.

If you did surf the site using Google Chrome you could use GoogleCacheViewer app and restore page by page directly from the cache.

If you are using linux (like me) the only option you have is to use that program with wine and restore de pages. This program works perfectly under wine with chrome 5 beta for linux, just point the cache to the correct place. Under ubuntu it will be ~.cache/google-chrome/Cache .

Signup Cafe24 said...

Yes! I finally found a guide to recover a website from the caches of search engines! Thank you for this! :)

Scott said...

Well, Signup Cafe24.... what is the "guide to recover a website from the caches of search engines" that you found? I am trying to recreate a complete website from 2002... it's part of my legacy and I would like to show my children what is was like back when!