[00:12] https://github.com/chao-mu/Exhumer/blob/master/modules/search/google.rb [00:13] 39G 2013.ftp.bpmmicro.com.zip [00:13] 55G ftp.bpmmicro.com [00:13] ah, the power of zip [00:57] Nemo_bis: that'll get you banned really quick [01:08] I gotta say - I'm really happy I'm going through this collection I got of 300 Commodore and general 8-bit books, and I'm finding maybe one out of 10 or 15 isn't one I already have up on archive.org. [01:10] :) [01:29] wow- that's impresssive [01:31] SketchCow: do we have the "basic computer games" and "more basic computer games" books by ahl digitized and up? [01:31] Well, I've REALLY been on this for a year plus now. [01:31] iirc theres an even older sold-by-dec version which predates the other published ones [01:32] I own a copy of "more basic computer games" [01:32] but not the first oen [01:32] sells for a bit on ebay last i checked [01:32] https://archive.org/details/Basic_Computer_Games_1978_David_Ahl [01:32] https://archive.org/details/bitsavers_decBooks10Mar75_26006648 [01:32] https://archive.org/details/Basic_Computer_Games_Volume_II_1980_David_Ahl [01:32] I CANNOT scan it right now due to idiot painters painting the window in front of the scanner and not cleaning it up properly so the scannr is moved away for a while [01:32] https://archive.org/details/More_BASIC_Computer_Games_1979_David_Ahl [01:33] https://archive.org/details/More_BASIC_Computer_Games_1979_David_Ahl [01:33] https://archive.org/details/More_BASIC_Computer_Games_1980_Creative_Computing [01:33] https://archive.org/details/Basic_Computer_Games_Microcomputer_Edition_1978_Creative_Computing [01:33] you think this is some motherfucking GAME [01:35] https://archive.org/details/More_BASIC_Computer_Games_1980_Creative_Computing <- that one should have "trs-80 edition" in its title [01:35] the 1979 one is the one i have [01:36] did you accidentally paste the 1979 link twice, or is there a difference between the two? [01:36] Accidentally. [01:36] and i already see some issues with the 1979 scan; the scan looks like it was auto-cropped to min size per page (which is kinda ugly); the cover is overly jpeg-compressed, and the very first page is completely missing [01:37] Huh, it's almost like someone did it for free [01:37] yeah. hmm [01:37] I'm trying to write a distributed web crawler… right now I'm using the S3-like API for Internet Archive uploads, but that has the unfortunate result of creating a lot of noise under my account. Is there a better way to go about this? [01:37] i'll scan the copy i have here at 800dpi when i get a chance [01:37] only the front, back and spine need to be in color, the rest is fine at 1-bit line art [01:38] kyan: you could always register a different account specifically for your S3 uploads [01:40] joepie91, Suppose so… things will still be getting tossed indiscriminately into the "texts" category though, oh well :( [01:40] thanks :) [01:40] https://archive.org/details/ftp.bpmmicro.com.2013.11 [01:42] I should make clear that as I'm stepping through these books, if I find that there is a radical difference in size between the two same-titled books, second title goes up anyway [01:43] If one if 605334 and the other is 605328 I don't. [01:43] But 605334 and 28993444, then I do [01:46] when did ISBN codes start showing up on the back cover of books? [01:46] kyan: if you tag them, SketchCow could probably toss them into the right collection afterwards [01:46] perhaps you could request a collection if you expect to be adding lots of data [01:47] :p [01:47] dashcloud: they've been on back covers of books (and first pages) in NL forever [01:47] often just above or below the barcode [01:47] joepie91, Right. I'll stick with tagging them for now (at least until I've actually gotten the code off the ground). :) [01:48] I have both the ahl books [01:48] alright :P [01:49] I scanned a little bit but stopped when I found it on another website [01:49] http://www.atariarchives.org/basicgames/ http://www.atariarchives.org/morebasicgames/ [01:51] I can slam through someone's stuff and shove the name around [01:52] kyan: if you're doing s3 uploads you should be able to set at least the mediatype [01:54] DFJustin, Oh! Thanks! I didn't realize there was a web mediatype, just looked at the list :) Thanks. [01:55] it'll still need to be put into an appropriate collection but then at least you get download links on the details page [01:56] "opensource_media" seems to be an open-access grab-bag collection [02:08] So, my plan for 2014 is intense fixing up of the collections. [02:11] Including making better tools for adding metadata and sorting things along with volunteers being able to assist more fervently. [02:11] And then expanding out in every direction with it, including better search and better interface. [02:11] So, you know, the usual shaking up [02:24] hooray for metadata [02:36] is there a way you can get more collection options available to people, even if it's just a suggestion to staff/curators/etc that an item should go a certain place or be marked as a certain type of item? [03:27] dashcloud: wouldn't think you'd notice much of performance difference between hypervisors running a warrior [03:28] thanks! [03:29] I was already running KVM on a headless server, so virtualbox was a bit hard for me [03:38] Having trouble finding documentation on scripting Heritrix3, doesn't seem to be in the manual… :( anyone know where it's hiding? [04:00] SketchCow: <3 [04:01] talk metadata to me [04:03] talk metadata to me [04:03] this should be in a topic. somewhere. anywhere. [04:08] http://www.flickr.com/photos/sarahseverson/6245395188/ [04:08] https://pbs.twimg.com/media/BKZqw-RCEAAWL1W.jpg:large [10:42] Uploading 139 gamebooks [10:42] https://archive.org/details/gamebooks [10:43] It'll fill out nicely. [11:03] uh 12k https://archive.org/details/64er_sonderheft_77 [11:19] man, I think haven't seen a gamebook in about 20 years. [11:37] We'll fix that [12:14] -------------------------------------------------- [12:14] HYVES IS NOW TOP PRIORITY - LESS THAN 8 DAYS LEFT [12:15] WE NEED TO DO 8 MILLION ACCOUNTS - WE HAVE 600,000 DONE [12:15] GO TO #ANGERTHEHYVES TO READ UP OR JUST GO TO THE [12:15] TRACKER AND AIM YOUR WARRIOR AT THE HYVES CONTENT GRAB [12:15] -------------------------------------------------- [12:17] Seriously, SketchCow taught me what I believe the only legit use of all caps across the whole Internet [12:19] I'm sure something like THE WORLD IS ABOUT TO END in serious context is probably worth it [12:20] yeah but that will be of use only in about 5 billion years or so [12:20] and I'm sure your heritage won't last that long, sorry [12:53] I'm even running a seasaw on this one! [15:52] damn Hyper-V doesn't support ova's I guess I do need to install VirtualBox... [15:53] do you guys archive url shorteners? [15:55] http://archiveteam.org/?title=URLTeam [16:24] someone should grab www.gamesniped.com .. owner has not posted in a month and is MIA to friends. Some pretty rare games have pics there, even one ebay auctions removes them [16:32] started an archivebot run [16:39] woo, got warrior loaded in Hyper-V, because I hate myself. [16:39] If it get it working I'll post up a blog post or something with instructions [16:49] AHh needed a legacy network adapter [16:49] then it worked great [16:54] where can I submit a feature request for the warrior, I want to run it at work but only at night. [17:11] https://archive.org/details/isohunt.teapot.2013 [17:11] https://archive.org/details/isohunt.coffeepot.2013 [17:12] https://archive.org/details/isohunt.croissant.2013 [17:28] "soHunt was" [17:33] which hyves to run? [17:33] discovery/content [17:33] 2 instances of content [17:34] Typo fixed. [17:34] never use virtualbox.. i have to import twice for two instances? [17:35] No, one warrior will do up to 6 at once [17:35] one vm, there's a setting on the dashboard for the number of concurrent items to run (2 by default) [17:35] but don't set it above 2 [17:35] oh, it seems i have two already [17:35] it makes 2 hdd images [17:35] Some people think they can help by upping it, but it doesn't work because Hyves just rate limits. [17:35] 60 gig is a pittance [17:37] ah.. it is in advanced settings [17:37] guessing this auto rsyncs? [17:37] Yes, you need not think about it again. [17:37] unless this Oracle product crashes my machine :) [17:40] 'rejecting possible infinite loop' is all i'm getting [17:42] it'll be fine [17:42] see, it's rejecting the loop [17:45] yay.. my name flashed by on leaderboard.. so something is working [17:46] what does hyves rate limit me to? i'm only getting 40K/S [17:51] because they hate everything good [17:52] how do i pause warrior if needed? [17:53] #warrior is really the best thing for this [17:59] close virtualbox and save state [18:04] SketchCow: I now have unlimited bandwidth at the house and, I know it's been a long time, but I still have the yahoo videos which I meant to get to sneakernet to you at HOPE/Defcon but never made it to either. Do you have an online repository I could offload to? [18:05] If I can offload those videos I could free up enough space to resume downloading new content [18:33] hello friends, has news of winamp.com's demise come to your attention? They have had forums for a long time but I don't know how far back they actually keep [18:34] shutoff date is dec. 20. [18:35] bananapwn: where has this been reported? [18:35] http://www.winamp.com/media-player/es [18:35] "Winamp.com and associated web services will no longer be available past December 20, 2013." [18:35] they have forums here: http://forums.winamp.com/ [18:36] "Additionally, Winamp Media players will no longer be available for download" [18:36] ...wow [18:36] thats some history gone right there imo [18:36] ! [18:36] Archivebooottttt [18:36] godane: http://forums.winamp.com/ is going down dec 20 [18:37] is shoutcast going away too? [18:37] that is a good question [18:37] I see nothing saying it is [18:38] lol/cry: http://forums.winamp.com/showthread.php?t=360236 [18:38] "No. Winamp & SHOUTcast are safe" [18:38] yeah right... [18:38] "winamp.com will be fine" (for a couple months) [18:42] http://www.archiveteam.org/index.php?title=AOL_Music [18:43] [18:43] The rumour is ShoutCAST and Winamp are safe for now. [18:44] that would seem to be a reference to this post: http://forums.winamp.com/showpost.php?p=2930593&postcount=2 [18:44] archive anyway ask questions later [18:44] the front page says unequivocally that it is going away dec20 anyhow [18:47] who's gonna whip the llama's ass now [18:47] that shit doesn't whip itself [19:13] Winamp.com and associated web services will no longer be available past December 20, 2013. Additionally, Winamp Media players will no longer be available for download. Please download the latest version before that date. See release notes for latest improvements to this last release. [19:13] Thanks for supporting the Winamp community for over 15 years. [19:14] http://archivebot.at.ninjawedding.org:4567/ [19:14] whoa, what happened? I loved winamp back in the day [19:15] they got bought out by aol ages ago and now it's bean counting time [19:21] thank you DFJustin [19:21] there's probably posts of 1998 me on that forum somewhere, maybe I'll see it again someday [19:27] does that mean shoutcast too? nothing on that side and the winamp one is vague. "associated web services?" [19:29] Better whip that llama's ass while we still can [19:47] Does it matter when I lose my connection during the Warrior's progress? [19:50] is there a channel specifically for Winamp that we should point people towards if they're willing to assist? [19:59] http://www.theverge.com/2013/11/20/5126666/winamp-media-player-shutting-down-after-over-15-years [19:59] Woo got a warrior running on windows 8 Hyper-V and got the scripts running on my home NAS machine [20:01] i wonder what the effects are on shoutcast and the shoutcast website [21:38] I suppose someone could ask on the winamp forums if it's still possible to sign up [21:49] eprillios: it'll keep trying to download/upload until you're connected again. [22:20] So, with WinAmp shutting down and all, I wanted to know if you'd archive all the plugins and skins. We have 15 years of history at stake! [22:23] archivebot is on the job [22:38] Archivebot? [22:41] a bot that archives websites [22:41] #archivebot [22:42] and the webinterface of it (where you see doing it stugg) is at http://archivebot.at.ninjawedding.org:4567/ [22:42] stuff* [22:43] cool [22:50] well, I have to go now [22:50] it's getting late here [22:50] see you guys tomorrow