[01:32] SIDE ARCHIVING JOB [01:32] Should be a snap for someone. [01:32] http://groups.google.com/group/google-friends/browse_thread/thread/ce418fa005b18e2f?pli=1 [01:32] The newsletter it references. Let's get all issues. [01:33] http://www.google.com/googlefriends/archive.html [01:34] All formats, text and html [02:31] on it [02:34] Thanks [02:38] Why did we mirror thesheepmarket? [02:47] never heard of it [02:51] only 1001 more GV files to send [02:51] Are you still uploading? [02:51] Which directory. [02:54] gv_11 [03:02] I need to spend a couple more days compressing Friendster stuff [03:04] then I can upload that [03:04] I'd like to give you a different home to upload it. [03:04] We really need to get off that machine, they've been on me for a month [03:07] and it's got 'gv' in the name [03:07] archive.org is not one huge cloud huh? [03:08] depends on what level you look at it from [03:11] it is. [03:11] But I was given a specific machine to work with [03:12] With no use of the rest. [03:12] all 'clouds' are really just a lot of individual machines [03:12] Once I put it into the main site, it'll work. [03:12] And that's what I'm doing, I'm finally collapsing and collating the data [03:12] Friendster uploading will happen soon, etc. [03:14] My main function in a lot of these projects is taking the data from all the sources and turning them into something resembling a presentable archive. [03:18] hmm- my efforts to use the pcmcia floppy drive I had around are seriously hampered by the fact that only windows 3.1 can actually access the floppy drive [03:19] Just buy new old stock 5.25" [03:20] it was actually a 3.5'' drive- I lost my last 5.25'' drive 3-5 years ago or so [03:21] I know. [03:21] Replace it with a 5.25" [03:21] Upgrade backwards [03:21] The industry won't suspect that [03:21] instant $$$$$profit$$$$ [03:21] People are just throwing that shit away [03:21] You can make it $hit [03:28] yup, i pull 3.5 and 5.25 from every old machine i scrap [03:29] about every 10th one is not encased in dust lol [03:47] anyone here know if it's possible to view traffic levels on your PCIe bus? [03:49] i'm told the newest 'process monitor' from MS can do something like that [03:49] at least for PCIE GPUS [04:06] in my case I want to see how much is going to my SATA controllers [04:31] You might be able to do that with iotop [04:31] ^ db48x [04:52] hmm [04:59] lol [05:01] -rw-r--r--. 1 db48x db48x 0 Jul 29 21:31 google-friends-files.zip [05:01] -rw-r--r--. 1 db48x db48x 314 Jul 29 21:30 google-friends-pages.zip [05:01] [db48x@celebdil GoogleFriendsNewsletter]$ ll *zip [05:01] [db48x@celebdil GoogleFriendsNewsletter]$ unzip -l google-friends-pages.zip [05:01] Archive: google-friends-pages.zip [05:01] Length Date Time Name [05:01] --------- ---------- ----- ---- [05:01] 0 07-29-2011 21:30 google-friends/__welcome.txt [05:01] 0 07-29-2011 21:30 google-friends/index.txt [05:01] --------- ------- [05:01] 0 2 files [06:06] hrm [06:07] my wget-foo -sn't working [06:07] wget-fu [06:12] why isn't wget --warc-file GoogleFriendsNewsletter --warc-max-size 0 --mirror -S -E -k -K -p --protocol-directories -np --follow-ftp --progress=dot:binary http://groups.google.com/group/google-friends following the link to http://groups.google.com/group/google-friends/t/e0f51eca975dc9ff ? [06:32] you didn't turn off robots.txt exclusions? [06:32] (unless you have it in your .wgetrc file) [06:33] User-agent: * [06:33] Disallow: /groups [06:33] Disallow: /search [06:33] first three lines of a very long file [06:34] er [06:34] wait. you said /group not groups [06:35] HTTP request sent, awaiting response... 403 Forbidden [06:35] 2011-07-30 02:35:17 ERROR 403: Forbidden. [06:35] perhaps it tried and got the 403? [08:53] Coderjoe: it's turned off in my wgetrc and it still doesn't do it [15:20] * SketchCow is copying a LOT of friendster. [15:21] "Don't copy, that floppy!" [15:23] ------------------------------------------------------ [15:23] PHOTOS OF ARCHIVETEAM MEMBERS WELCOME FOR DEFCON PRESENTATION [15:23] MAIL PHOTOS, OBSCURED OR WEIRD ALLOWED, TO JASON@TEXTFILES.COM [15:23] ------------------------------------------------------ [16:04] SketchCow, can you rerun http://www.us.archive.org/log_show.php?task_id=81420067 ? [16:04] or kill it, whatever [16:10] Directory /28/items has only 364544 bytes free. This is dangerous as we don't want to corrupt your item! [16:10] FATAL ERROR: [16:10] An administrator will rerun your task once we have cleared space on this drive. [16:10] No error report is necessary. [16:17] aww [16:17] it's only 60 MiB or something [16:17] hm, *150 [18:59] http://torrentfreak.com/diglo-social-networking-for-avid-file-sharers-110729/ [19:00] "You rip shit off?" "yeah" "me too" "" [19:01] how easy would it be to dump hundreds of gaming videos ("frag videos", trickjumping and the like) to archive.org? would one need to write a description for each? is there some easy (cli?) frontend for mass uploads? [19:02] 1. Easy. 2. You should. 3. There's ways to do it. [19:02] How about you work with me on it. [19:03] For example, http://speeddemosarchive.com/ basically uses archive.org as their back end. [19:04] sweet [19:04] i'll collect and come back :) [19:05] Ah, the poor misunderstood TAS community~! I had wondered about how everything they do seemed to wind up on IA. [19:06] not TAS, those are usually non tas [19:07] Oh, right, the TAS people are on tasvideos. [19:28] There's Quake Done Quick and then there's the rest of the speed videos [19:29] Quake Done Quick With a Vengance, I have showed to rooms of people. It still reigns. [19:29] http://www.youtube.com/watch?v=OrkAuwaoFGg [19:30] i wish they would not use such ugly retexture remodel reugly mods on quake stuff so often [19:31] damn this video is pure genius [19:35] http://www.youtube.com/watch?v=-toKfJW6g8Q [19:35] The best 10 minutes of your life [19:36] When you see that guy solve one of the levels in 9 seconds, you want to throw out the bible and start praying to him. [19:36] stock glquake which looks bland and is missing graphical features from the software renderer :( [19:36] heh [19:36] i love the one that is featured on wikipedia too [19:36] where he grenadejumps off a monster that is just falling into a lava pit [19:37] oh fuck wikipedia [19:37] On it [19:37] Been doing it for years [19:39] ah, there it is http://en.wikipedia.org/wiki/File:Quake_e4m3route.ogg [19:39] i liked it before it was on wikipedia! [19:42] ... [19:42] http://www.gdcvault.com/play/1014853/GDC-2000-Keynote-Phil-Harrison [19:42] http://www.gdcvault.com/play/1014851/Console-Keynote-Changing-for-the [19:42] http://www.gdcvault.com/play/1014852/GDC-2000-Opening-Keynote-Bill [19:43] no flash no joy [19:43] Things that are not my problem, example #1 [19:43] :) [19:44] http://en.wikipedia.org/wiki/List_of_things_that_are_not_my_problem [19:44] #1. You can't install a plugin [19:44] but isnt it a weird feeling to do this? [19:44] #2. You are poor [19:44] #3. You are not being catered to [19:45] #4. You're not attractive [19:45] #6. My toast burned [19:47] if you got a moment please give me some pointers for easy archive.org uploading [19:47] Tell me what you're uploading first. [19:48] quakeworld/quake movies, starting with the collection of ftp://gamefiles.blueyonder.co.uk/blueyondergames/blueyondergames/trailers/movies/quakeworld/ [19:48] i'd unpack the archived ones and upload the contents for those [19:49] Are you sure they're not already uploaded? [19:52] some of them might be there already but the majority definitely is not [19:58] OK, well. [19:58] The short form is get an account, then get an s3 item [19:58] I mean auth [19:58] Then you can do a command line and shove items in [19:59] But really, take the time to do descriptions. Real ones. [19:59] Even if it takes weeks. [20:04] ^ [20:07] http://www.wired.com/epicenter/2011/07/undeletable-cookie/ [20:09] Old news, irrelevant to archive team, stop. [20:09] The machine I've copying friendster onto went down, waiting for return. [20:09] Trying to figure out what to do with this twaudio. [20:13] cheers [20:13] can i let other people edit descriptions? [20:13] There's no easy way to do that, but I will happily midwife being given alternate descriptions handed to me in a textfile that I plug in. [20:14] In the name of getting shit right. [20:22] that is some bloody awesome interface [20:22] i forgot about its exsitence [20:31] can anyone create meta collections? [20:35] nvm [20:43] hm, uploaded with --header 'x-archive-meta-mediatype:movies' but it ended up in Ebook and Texts Archive > Community Texts [20:43] i suck [20:44] Let me help. [20:45] You know, like I offered. [20:47] --header "authorization: LOW $accesskey:$secret" \ [20:47] --header 'x-archive-meta-mediatype:movies' \ [20:47] --header 'x-archive-meta-title:Ben plays piano.' \ [20:47] --header 'x-archive-meta01-collection:opensource_movies' \ [20:47] curl --location --header 'x-amz-auto-make-bucket:1' \ [20:47] --upload-file ben-2009-05-09.avi \ [20:47] http://s3.us.archive.org/ben-plays-piano/ben-plays-piano.avi [20:47] look at that. [20:57] well, i did not specify a meta collection because i can't find a list of them and it surely is not opensource content i upload [20:58] opensource_movies is their public movies collection. [21:00] ah, thanks [21:00] the url sure is misleading :) [21:00] oh good, only a 1000 files to upload [22:20] we've less than 5000 items in the queue [22:21] * Nemo_bis wants more Phil. Trans. [22:23] How about we don't rape the archive.org queue with what represents a denial of service attack. [22:23] Let the queue die down a tad, then go back in. [22:32] aww [22:32] those are quite fast to derive, btw [22:32] and there's no DoS that I'm aware of [22:33] there were 300 items in the queue last weekend [22:36] can one set lower deriving priority via s3? [22:47] psh, archive.org hq needs more heating [22:49] Buy them more servers? [23:09] *buy them older, less efficient servers [23:17] no, just keep them busy