#archiveteam 2012-11-13,Tue

↑back Search

Time Nickname Message
09:47 🔗 Nemo_bis someone available to help me do a curl POST requests script?
09:49 🔗 Nemo_bis i'm probably doing something horribly wrong with urlencoding and other escaping but I might smash my head against it for two hours before I fix it myself
09:51 🔗 alard Nemo_bis: Yes.
09:51 🔗 Nemo_bis thans
12:26 🔗 alard SketchCow: Rough estimate of dailybooth = 2MB/ID * 2,000,000 IDs = 4.5TB.
12:27 🔗 alard Probably a bit more than that.
17:30 🔗 Lord_Nigh question that is actually on topic: when will archive.org/wayback machine use the robots.txt as existed at the time of the page archiving to decide whether to block access, as opposed to using the robots.txt of right now? there are several sites which had good info back in the day which is impossible to get because the current squatters have a 'block all' robots.txt
17:32 🔗 mistym Lord_Nigh: Yeah, that bugs me a lot too. It's the problem with taking domain name as the definitive identifier for a site, when the owner and identity of the domain itself can change.
17:33 🔗 Lord_Nigh i'm not sure if sedo marketing (a huge multimillion dollar squatting operation) does block all on all sites they have, but some other squatters do
17:49 🔗 Lord_Nigh heck there was even a legal case where the defense lawyers for a site which had posted some info changed the robots.txt DURING THE TRIAL. i gather the judge wasn't too happy about that.
17:50 🔗 Lord_Nigh hence i think the whole 'retcon-blocking' by changing current robots.txt could allow site suppression
21:28 🔗 Pepote WWW.JIZZDAY.COM
21:29 🔗 ersi Lord_Nigh: There's #internetarchive for repeating the robots.txt stuff of Wayback Machine
21:47 🔗 Lord_Nigh i'm in too many irc channels already
21:49 🔗 ersi Yeah, well I'm pretty tired of hearing the massive cuntflap around that particular feature in here
21:49 🔗 chronomex misfeature
21:49 🔗 ersi It's related to archiving, sure. It's not related to us, archiving
22:12 🔗 Lord_Nigh sorry.
22:57 🔗 joepie91 1TB seagate barracuda 7200rpm drives for $50: http://www.newegg.com/Product/Product.aspx?Item=N82E16822148697&nm_mc=AFC-C8Junction&cm_mmc=AFC-C8Junction-_-na-_-na-_-na&AID=10440897&PID=3668349&SID=
22:58 🔗 joepie91 you're welcome :)
23:53 🔗 swebb Whatever happened to the 2TB WD drives for $65? How come we don't see those anymore?

irclogger-viewer