[02:37] Excellent [14:52] seems that twitpic is gone [14:53] frontpage is empty [14:53] but, data still exists: http://twitpic.com/135xa [14:54] very odd [14:54] send warrior after areas that need grabbing [15:11] 503 Service Temporarily Unavailable [15:11] cc midas godane [15:11] (for the image URL) [15:12] hm? works here? [15:12] refresh it a couple of times [15:15] https://dn3pm25xmtlyu.cloudfront.net/photos/large/1827262.jpg?1232052212&Expires=1414336322&Signature=M3Dx66kXYdcgrTVlxqBCPqZ2osLKCwd4OZj84B3L0h~33bi~TCd8395uq5ImqqVCZfCqUUwdenExmfPACy1rUXX8lIxK2v8fyZMUt6r-kZMO3b039g1hBRBWy9BM-EzmwENCD7qf34fFLjrTylRqCpIHvyKEQ3453-jz7F38wNA_&Key-Pair-Id=APKAIYVGSUJFNRFZBBTA [15:21] Outlook cloudy. [15:25] i was hoping we could just attack the number.jpg ulrs [15:25] but no [15:25] you need special keys [15:25] that's the change Noah made once the 500m were grabbed [15:25] oh [15:25] sadly we didn't have the space for the last 300m [15:25] at that point of time when it was still unblocked [15:26] and because they announced no-close as well [15:26] there was no incentive to push for the last 300m [15:26] when they did announce re-close, too late, already locked the photos with key [15:26] and archiving had temporarily stopped because he said "Yay, Twitpic is acquired, you want to go skateboards?" [15:26] [This is how website owners speak, I imagine] [15:26] if we grabed the last 300m it would ahve probably been easier at this point really [15:27] in worse case we could have just redirected the base-36 URLs to base-10 image links [15:28] either way nothing more we can do [15:28] 500m is still a very good result [15:28] especially given the circumstances [15:28] let's just say if they didn't announce the acquire [15:28] we would have continued [15:29] and probably got close to current [15:29] Hindsight is always 20/20. [15:29] i blame it on the lack of storage [15:29] As you say, it was hard to make a case for the space once they said it wasn't being wiped [15:29] i was pushing 3-4gbps nonstop for the cloudfront project [15:30] i got the 500m in 3 days [15:30] even in the light of "won't be fooled again" there are always practical limits [15:30] I'd quote House, "Everybody lies" [15:30] Hehe. :) [15:31] All those weeny commenters on HN, all "they can't afford the transfer costs" and stuff. [15:31] You'd have thought we were taking a dying man's last dime. [15:31] This is starting to be something for #archiveteam-bs [15:31] the 500m would have cost them a few thousands i guess [15:32] It's a useful discussion for considering future AT policy and approaches to cases like these, though, maybe, ersi? [15:32] ersi: i think this is something everyone needs to learn about when archiving all future sites [15:32] but -bs is fine too [15:33] AT needs a firm solution for the storage issue [15:33] * antomatic nods [15:33] someone give me a bunch of twitpic URLs [15:33] ArchiveBot isn't banned and we have image access [15:33] weh? [15:33] just do it [15:34] yipdw: you can always do base36 conversion for 500m+ [15:34] Well, as long as it's not fretting about what we've lost for hundreds of lines and useless policy talk [15:34] yipdw: 49.213.23.196 [15:34] gah bad paste [15:34] Policy is, do (archiving) > talking [15:34] http://twitpic.com/89oqgx [15:35] ersi: yes but when storage is an issue, not possible to "do" [15:35] archiving without a plan is just masturbation. :) [15:35] watch http://archivebot.at.ninjawedding.org:4567/ [15:35] * antomatic watches [15:35] Sometimes, making a plan is when archiving doesn't happen and the building burn down [15:35] So it's not just masturbation. Whatever, I'm out :) [15:36] maybe I need to phantomjs that [15:36] huh yeah [15:37] so yeah, either keep talking about archiving or URLs [15:37] yipdw: it worked [15:37] you grabed the image fine, 200 [15:37] it's full of signed request shit, but yeah [15:37] well because it's a complete grab, the signing is fine [15:37] the html will link to the image [15:37] yeah, it'll just screw wayback up without some processing [15:37] oh yeah or that [15:37] ANYWAY [15:38] lets move this to #quitpic [15:38] ok [15:39] antomatic: thats why i create plans with my archives [15:39] looks at my funnyordie collection and you see the plan [15:39] you do great work, godane. that needs to be said. [15:39] always. [15:39] grab older stuff first [15:40] makes sense [15:40] my advices is more for the stuff thats not shutting down [15:41] cause otherwise you lose the stuff that more popular [15:43] also this at least give you some thing if they shutdown with no warning [15:44] with twitpic the best way of archiving that IA can handle is maybe grabbing the url images on the front pages that IA has [15:45] this way you give IA something and you have less dead links in wayback machiine [15:46] anyways i'm grab more global news videos [15:46] like News Hour and News Hour Final [16:33] Morning [16:33] This went to -bs quick [18:31] I am already running scripts on a couple of VPS boxes but when i try & run the scripts on a VM on my local LAN i am just getting Server returned 0 .........Sleeping [18:32] What am i doing wrong ? or not doing that i should be ? [18:34] does the vm have internet access? [18:34] yeah [19:07] TWITPIC "ACQUIRED" BY TWITTER. http://blog.twitpic.com/2014/10/twitpics-future/ [19:09] (the quotation marks mean "kind of") [19:10] its at least saved in a read-only state [19:10] but this one bothers me: You will still be able to login to your profile to delete content or delete your account on Twitpic.com [19:11] also twitter gives all there stuff older then 6 months to the library of congress i think [19:13] "Twitter shares our goal of protecting our users and this data." [19:13] hahahahahahahaha. [19:13] I don't think there's any sharing involved there... [19:21] godane: At the very least, if you send communication to Twitter saying "hey, we'd like to archive stuff!" they're likely to send a response. :P [19:23] pikhq: try to send actual communication to Twitter and then say that again :P [19:23] I didn't say it would be more than a formletter. [19:23] pikhq: nono, you misunderstand [19:23] the problem isn't in the response received [19:23] the problem is in actually sending them a communication [19:23] I've tried this before [19:24] They *do* have a mailing address, no? [19:24] after an hour I gave up on finding anything that wasn't an incredibly narrowly defined topic-specific contact form [19:24] pikhq: snail mail? possibly, I have no idea [19:24] certainly not any kind of online human support [19:26] Huh, impressive. Their website actually makes it hard to find contact info. [19:27] yep. [19:27] contract to Yahoo, the much-maligned destroyer of data, who have an actual online support department [19:27] contrast * [19:27] with a human responding [19:27] :P [19:27] anyway, this is going into -bs territory [19:33] "going" [19:33] shush :P [19:33] Dude, we passed WELCOME TO -BS, FIRST IN TANGENTS on the highway 3 hours ago [19:33] the siren didn't even go off yet! [19:57] SketchCow: i started grabbing News Hour Toronto [19:58] looks like they may have better archives then Global National archives [20:00] someone wants an off topic siren? [20:00] woop woop woop off-topic siren [20:00] oh, that was half an hour ago [20:09] Excellent [20:10] looks like i can get episodes going back to 2013 [20:17] SketchCow: there is also a english south korea news program called KBS News Today [20:17] there episodes of that on youtube [22:58] https://web.archive.org/web/20141007064755/http://www.genealogy.com/users/m/e/i/Paula-Arleen-Meinert/WEBSITE-0001/UHP-1583.html [22:58] https://web.archive.org/web/20141007063953/http://www.genealogy.com/users/m/e/i/Paula-Arleen-Meinert/WEBSITE-0001/UHP-Index.html [22:58] https://web.archive.org/web/20140929010450/http://www.mundia.com/be/Person/27883978/5094683264 [22:58] it's there [22:58] and it's awesome [22:59] :) [23:01] https://web.archive.org/web/20140924190350/http://www.mundia.com/af/Search/Results?surname=BAHA&birthPlace=Afghanistan