[00:04] *** xk_id has quit IRC (Read error: Operation timed out) [00:16] *** mistym_ has joined #archiveteam [00:18] *** Sk1d has quit IRC (Ping timeout: 606 seconds) [00:21] *** mistym has quit IRC (Read error: Operation timed out) [00:39] *** BlueMaxim has joined #archiveteam [00:39] *** achip has joined #archiveteam [00:59] *** mistym_ has quit IRC (Read error: Operation timed out) [01:07] *** db48x has joined #archiveteam [01:08] *** mistym has joined #archiveteam [01:23] What where [01:25] *** signius_ has quit IRC (Ping timeout: 512 seconds) [01:27] *** mistym has quit IRC (Remote host closed the connection) [01:34] *** signius_ has joined #archiveteam [01:53] *** human39 has quit IRC (Leaving) [02:18] *** achip has quit IRC (Remote host closed the connection) [02:24] *** Ymgve has quit IRC () [02:30] *** kyan has quit IRC (Remote host closed the connection) [02:57] *** phuzion has quit IRC (Read error: Connection reset by peer) [03:01] *** phuzion has joined #archiveteam [03:25] *** db48x has quit IRC (hub.efnet.us irc.Prison.NET) [03:25] *** bsmith093 has quit IRC (hub.efnet.us irc.Prison.NET) [03:25] *** abartov has quit IRC (hub.efnet.us irc.Prison.NET) [03:32] *** db48x has joined #archiveteam [03:32] *** bsmith093 has joined #archiveteam [03:32] *** abartov has joined #archiveteam [03:34] *** primus104 has quit IRC (Read error: Connection reset by peer) [03:40] So, rapidshare [03:56] yeah, i had a look on wikipedia for links that still work and none of them do [03:59] there's links on the archiveteam wiki page [03:59] i found them through a rapidshare search engine [03:59] https://rapidshare.com/files/251393042/Google_Earth_pro_5.0.1337.rar [04:00] https://rs602p2.rapidshare.com/cgi-bin/rsapi.cgi?sub=download&fileid=251393042&filename=Google_Earth_pro_5.0.1337.rar [04:02] i found it through this search engine: http://rapid-search-engine.com [04:04] we could try scraping it or asking it's owners for the database [04:07] looks like reddit also has it's share of valid links (once you scroll past the shutdown notices): http://www.reddit.com/domain/rapidshare.com [04:10] we could also scrape google with the keyword "intext:rapidshare.com/files" [04:14] *** mistym has joined #archiveteam [05:05] *** Spring has joined #archiveteam [05:21] *** aaaaaaaaa has quit IRC (Leaving) [06:35] reposting from the main rapidshare channel: [06:35] "RapidShare claimed in 2009 to have 10 petabytes of files uploaded by users to its servers" [06:35] that screams nightmare project [06:36] so we'll probably have to do requests only like we did for twitch so that brewster doesn't get mad [07:09] I'm pretty sure their retention has gotten worse since then [07:36] *** mistym has quit IRC (Remote host closed the connection) [08:51] indeed [08:51] i'd be suprised if it's over 100TB [09:04] *** Sk1d has joined #archiveteam [09:07] *** primus104 has joined #archiveteam [09:33] *** ohhdemgir has quit IRC (Read error: Operation timed out) [09:40] i'm pretty sure they've had redesigns since that killed lal old files [09:41] plowdown would probably help though [09:41] for downloads [09:42] *** xk_id has joined #archiveteam [09:47] *** xk_id_ has joined #archiveteam [09:49] *** ohhdemgir has joined #archiveteam [09:53] *** xk_id_ has quit IRC (Read error: Operation timed out) [09:55] *** xk_id has quit IRC (Ping timeout: 512 seconds) [10:08] *** xk_id has joined #archiveteam [10:52] *** Muad-Dib has joined #archiveteam [11:23] I misread "plowdown" as "pleclown" (a Wikimedia Commons admin) [11:27] *** marvinw has quit IRC (Read error: Connection reset by peer) [11:28] *** db48x has quit IRC (Read error: Connection reset by peer) [11:32] *** schbirid has joined #archiveteam [11:39] *** xk_id has quit IRC (Remote host closed the connection) [11:39] *** xk_id has joined #archiveteam [11:43] *** xk_id has quit IRC (Remote host closed the connection) [11:43] *** xk_id has joined #archiveteam [11:47] *** marvinw has joined #archiveteam [11:49] *** xk_id has quit IRC (Read error: Connection reset by peer) [11:50] *** xk_id has joined #archiveteam [12:08] *** kumar has joined #archiveteam [12:08] *** kumar has quit IRC (Client Quit) [12:29] *** Ymgve has joined #archiveteam [12:49] *** xk_id_ has joined #archiveteam [12:49] *** xk_id has quit IRC (Read error: Connection reset by peer) [12:54] *** xk_id_ has quit IRC (Remote host closed the connection) [13:30] *** bauruine has quit IRC (Quit: ZNC - http://znc.in) [13:30] *** bauruine has joined #archiveteam [13:53] *** godane has quit IRC (Ping timeout: 248 seconds) [13:56] *** sankin has joined #archiveteam [14:02] #mediacrushed could use some people in there, just invited a found to help us out, it's only 2.7TB, join us [14:06] *** godane has joined #archiveteam [14:06] *** primus104 has quit IRC (Leaving.) [14:25] *** xk_id has joined #archiveteam [14:34] So, in terms of rapidshare [14:35] My attitude is this [14:35] There's a lot of crap, which we know. Videos, Audio, stuff elsewhere. [14:35] What I'd like to focus on is maybe anything where we, for example, look through Github and find every rapidshare link [14:38] *** sivoais has quit IRC (Ping timeout: 512 seconds) [14:39] *** xk_id has quit IRC (Read error: Operation timed out) [14:39] I realize it will be piecemeal, but we knew that [14:41] *** sivoais has joined #archiveteam [15:01] *** xk_id has joined #archiveteam [15:05] SketchCow: there *is* a channel for that. #rapidscare [15:14] *** mistym has joined #archiveteam [15:32] Start: we can do it manually [15:36] *** mistym has quit IRC (Remote host closed the connection) [15:38] SketchCow: please join #mediacrushed [15:38] SketchCow: are you able to make whole warc's with links unavailable for viewing in the wayback machine? [15:45] *** K4k has joined #archiveteam [15:46] *** rejon has joined #archiveteam [15:46] *** rejon has quit IRC (Connection closed) [15:47] *** rejon has joined #archiveteam [15:52] Explain what you're trying to do. [16:06] *** xk_id has quit IRC (Remote host closed the connection) [16:07] *** xk_id has joined #archiveteam [16:07] *** rejon has quit IRC (Ping timeout: 512 seconds) [16:07] *** khaoohs has quit IRC (Read error: Connection reset by peer) [16:11] *** khaoohs has joined #archiveteam [16:12] *** xk_id has quit IRC (Read error: Operation timed out) [16:16] *** mistym has joined #archiveteam [16:19] *** xk_id has joined #archiveteam [16:21] *** achip has joined #archiveteam [16:31] *** primus104 has joined #archiveteam [16:52] *** Start has quit IRC (Disconnected.) [17:01] *** xk_id has quit IRC (Read error: Connection reset by peer) [17:02] *** xk_id has joined #archiveteam [17:15] *** aaaaaaaaa has joined #archiveteam [17:35] *** achip has quit IRC () [17:41] *** Start has joined #archiveteam [17:41] sent 906,748,810 bytes received 629,038,682,806 bytes 1,027,530.30 bytes/sec [17:41] total size is 804,744,345,837 speedup is 1.28 [17:41] Now, that's a rsync [17:41] *** Start has quit IRC (Read error: Connection reset by peer) [17:43] *** Start has joined #archiveteam [17:45] Sweet indeed [17:45] Rather slow though? [17:45] *** Start has quit IRC (Client Quit) [17:46] Well, a lot had to happen. [17:46] Nemo, was it you who sent me the Italian gamer magazines? [17:46] SketchCow: more than one, actually [17:46] You said they got rather wet [17:47] They did [17:47] But many didn't [17:47] SketchCow: if you look for an email from federicoleva@, you should find an inventory [17:47] Ok [17:47] Anyway, a lot just went to Rochester, NY and the Strong Museum of Play [17:47] Oh! [17:47] Where they are to be catalogued, accessible to students and researchers, and possibly digitized [17:48] Do they work with IA? [17:49] * Nemo_bis skimming http://www.museumofplay.org/collections/library-collections [17:49] They even have a blog post on the topic http://www.museumofplay.org/blog/chegheads/2009/08/why-collect-gaming-magazines/ [17:50] They'll be doing a press release on my donation [17:50] "Historians 50, 100, 200, and even 500 years from now will be able to research and learn about the history of electronic games by studying these colorful periodicals." This sounds optimistic [17:51] I'd be surprised if the paper of those magazines lasts more than a century [17:51] Nice, I'll be curious to read it :) [17:52] It's really wonderful that you found a home for your collection that suits your goals! [17:53] Yes [17:53] Paper does well [17:53] The container etc. was nice but an existing institution which is really devoted to this stuff is much easier and more effective [17:53] But the plan is to scan them [17:56] institutional libraries generally rebind magazines in hardcovers with a year per volume which adds some protection [17:57] Doesn't help much, if the paper itself "melts" [17:57] yeah we didn't have magazine paper 500 years ago so I don't think anyone can say for sure what will happen to it [18:01] With no offense to Jason's library, I admit that having a museum receive my 100 kg or so of paper makes me appreciate more the few hundreds euro I spent on the effort [18:01] https://www.flickr.com/photos/textfiles/sets/72157648458418953/ [18:01] Well, the cube was never an end point. [18:01] People who thought or think that were deluded [18:01] Sure, but I didn't know what the end point would be for sure [18:02] I did [18:02] I also like the IA physical library [18:02] Oh, good :) [18:02] (There's more photos, but the phone's still charging.) [18:02] I didn't know you knew, but I was right trusting you! [18:02] that's a lot of paper being moved [18:03] shit's getting real when the pump truck comes out [18:03] Especially across the ocean and with post services which like to leave paper under the rain or something [18:04] Nemo_bis: you shipped 100kg of shit across an ocean? [18:04] that couldn't have been cheap [18:04] does strong take stuff like PC Magazine then or was that just being rearranged [18:05] (talking of which, i liked that 50mhz FASTEST PCS EVER cover) [18:08] PC Magazine was taken [18:09] They have a lot of stuff, and they took stuff I had they didn't. [18:09] They wanted PC Magazine. Didn't want Mondo 2000 or Wired. Wanted MAD Magazine. Didn't want Macworld [18:11] *** mutoso_ has quit IRC (Ping timeout: 265 seconds) [18:14] Adding 36 more photos, then sorting by time taken [18:15] https://www.flickr.com/photos/textfiles/16324255088/in/set-72157648458418953 hah [18:15] https://www.flickr.com/photos/textfiles/sets/72157648458418953/ is now in taken order [18:15] that short-lived thing where you would print a book about the internet [18:25] SketchCow: i'm grabbing pri the world wma's from wayback [18:26] it looks like you guys have a good chunk of it between 2002 to 2005 [18:26] but we are missing so much of that podcast [18:26] *** mistym has quit IRC (Remote host closed the connection) [18:27] Hah, my mom would like this place too :) https://www.flickr.com/photos/textfiles/16511989005/in/set-72157648458418953 [18:32] *** cadbury_ has quit IRC (Quit: leaving) [18:33] it's good somebody has a fix it felix for posterity, still needs dumping though [18:36] *** mistym has joined #archiveteam [18:37] SketchCow: if I have or find other stuff, should I ask them directly whether they are interested or tell you? [18:37] I still have some bags in the cellar [18:46] *** mistym has quit IRC (Remote host closed the connection) [18:47] http://projectbluebook.theblackvault.com/ [18:47] *** mistym has joined #archiveteam [18:48] *** Start has joined #archiveteam [18:53] *** Emcy has quit IRC (Read error: Connection reset by peer) [19:22] *** Start has quit IRC (Disconnected.) [19:22] *** Start has joined #archiveteam [19:31] *** Start has quit IRC (Read error: Connection reset by peer) [19:31] *** Start has joined #archiveteam [19:36] *** aaaaaaaaa has quit IRC (Read error: Connection reset by peer) [19:36] *** aaaaaaaaa has joined #archiveteam [19:37] *** Start has quit IRC (Disconnected.) [19:45] *** mistym has quit IRC (Remote host closed the connection) [19:52] *** Emcy has joined #archiveteam [20:07] *** VonGuard_ has quit IRC (Read error: Connection reset by peer) [20:10] *** signius_ has quit IRC (Ping timeout: 512 seconds) [20:11] *** K4k has quit IRC (Read error: Operation timed out) [20:16] *** VonGuard_ has joined #archiveteam [20:23] *** signius has joined #archiveteam [20:37] *** Start has joined #archiveteam [21:12] Start: I think we can do testflight with archivebot [21:13] what about the pages that require a login? [21:13] *** xk_id_ has joined #archiveteam [21:14] that can't be done with archivebot [21:14] I'll get on it now [21:14] hmm, I remember we had an account? [21:15] *** mistym has joined #archiveteam [21:16] *** xk_id has quit IRC (Read error: Operation timed out) [21:19] i'll see if i wrote down the credentials anywhere [21:20] *** xk_id_ has quit IRC (Remote host closed the connection) [21:21] *** mistym has quit IRC (Remote host closed the connection) [21:23] *** Start_ has joined #archiveteam [21:23] *** Start has quit IRC (Read error: Connection reset by peer) [21:23] *** Start_ is now known as Start [21:25] *** mistym has joined #archiveteam [21:45] *** schbirid has quit IRC (Quit: Leaving) [21:54] chfoo: can you please add https://github.com/ArchiveTeam/testflight-grab to the projects.json? [21:55] and add a FOS rsync? [22:04] arkiver: ok, done [22:05] *** sankin has quit IRC (Leaving.) [22:09] thanks! [22:14] *** wp494 has quit IRC (Ping timeout: 265 seconds) [22:19] *** Start has quit IRC (Disconnected.) [22:20] *** wp494 has joined #archiveteam [22:20] *** wp494 has quit IRC (Excess Flood) [22:21] *** wp494 has joined #archiveteam [22:21] *** wp494 has quit IRC (Excess Flood) [22:21] *** wp494 has joined #archiveteam [22:26] *** Emcy_ has joined #archiveteam [22:29] *** Emcy has quit IRC (Ping timeout: 265 seconds) [22:42] *** primus105 has joined #archiveteam [22:45] *** primus104 has quit IRC (Read error: Operation timed out) [23:21] *** Start has joined #archiveteam [23:53] *** Start has quit IRC (Disconnected.)