[11:48] swebb: My bad, I misunderstood the output of your program [11:49] e.g for bit.ly/glyyh, which redirects to http://friends.myspace.com/index.cfm?fuseaction=profile.friendmoods&friendId=401903151&dateTime=308909160, which redirects to another URL and finally to http://www.myspace.com/error?ETOID=0&EC=404 [11:50] I expected the line in the output file to be: bit.ly/glyyh|http://friends.myspace.com/index.cfm?fuseaction=profile.friendmoods&friendId=401903151&dateTime=308909160 [11:50] But it actually is bit.ly/glyyh|http://www.myspace.com/error?ETOID=0&EC=404 [11:53] This unfortunately means that I can't directly import your file into the tinyarchive database [11:53] But I will add it to the release as a separate file because it is still very useful [11:53] Apropos release. i am currently running the database dump/release process on a server at home [12:33] Oh snap, I broke the tracker [12:36] oops :) [16:00] soultcer: Hmm. I see that you're right, but it shouldn't be that way. I'll have to check things out. [16:01] The example that I gave was not transparently ignoring all intermediate redirects. It should be logging all 301's. [16:12] Ok, I think that I found my bug. I can re-run everything and re-crawl/re-unwind everything, but it might take a (long) while since I've got like 15 months of data to re-crawl. [16:13] Ouch [16:14] Yup. Not a pleasant thing to do, but doable. [16:14] yeah