[00:12] i'm mirroring more m.wsj.net/videos/ urls [01:47] SketchCow: you really need to fix this: https://archive.org/details/wsjtecham [01:47] its meant to be a collection not a item [01:48] also getting access to both wjstecham and wjstechpm collections would be great [01:50] i'm pushing more of it to godaneinbox for right now [02:03] DFJustin, dumped what I had of yandre, going at it with a little more tact, otherwise the work i'd have to do it post is going to take forever [02:39] you might also consider http://archiveteam.org/index.php?title=Wget_with_WARC_output [02:40] (ohhdemgir) [02:40] will do that later, just getting all original images for now [03:26] i'm grabbing the cbsradio hourly and update mp3s [03:27] it goes back to at least 2009 [03:27] its most likely going to be like my cbs video dumps [03:28] where i put every mp3 of that of one day into one item [03:30] items are mostly likely going to be like cbsradio-2009-01-01-Hourly and cbsradio-2009-01-01-Update [03:31] cause each hour has a hourly and update mp3 [03:34] going my through my bookmarks, I found this one: http://matkelly.com/wail/ someone else's earlier attempt at an archivebot-like program on a smaller scale [03:35] >Linux (coming soon) [03:35] :( [03:43] that's the sad part [03:44] there appears to be nothing in WAIL that is platform-dependent and not working on Linux [03:45] oh, path manipulation [03:45] that's not that bad :P [03:46] I'd say ArchiveBot is a lot simpler [03:46] despite how many programs and languages are involved in it [03:58] Result: [06:30] I'm starting to upload World News Today episodes i got from abcnews.go.com website [06:31] https://archive.org/details/abcnews-now-world-news-tonight-2006-07-03 [06:32] i'm calling it abcnews now edition cause alot of abc videos say that at the beginning of the web videos [06:34] regarding the Law and Order computer shots. at the github page [06:34] Why aren't the screenshots here? [06:34] Because there are 11,000 of them, and they're copyright NBC/Universal. If you'd like access to the archive for research purposes, please just email me. [06:47] arkhive, I read an article on that not so long ago... that was you? :O [07:09] arkhive: well, we got a copy -> http://archivebot.at.ninjawedding.org:4567/#/histories/http://computersonlawandorder.tumblr.com/ [07:10] not 11,000, but hey [07:10] i guess if the guy ever puts up more we'll just have to get them too [08:13] Could we redo tivo community? [08:13] Either through bot or through something else [09:25] SketchCow: archivebot job submitted; I also manipulated the job routing so that tivocommunity's sitting on an EC2 m1.medium [09:25] hopefully that will be enough RAM [09:26] that said, the last grab isn't that old: http://archivebot.at.ninjawedding.org:4567/#/histories/http://www.tivocommunity.com/ [09:26] though I suppose it did have a lot of errors [09:42] Good. [09:42] Itwas deleted. [09:42] I had to delete about 5-6 of the aborts to make it not kill internet archive. [09:48] so, some of the archives can actually kill the wayback machine? [10:37] lol [10:37] we're killing the system that is archiving the entire internet? [10:37] I'd love to know how that works [10:37] :D [10:56] No, no. [10:57] Not killing the wayback machine, erroring out on the scripts that try to analyze the thing for WARC readability. [10:57] So it never gets on stage [10:57] And the ingestion stops dead. [12:40] still, lets not break that bit ;-) [12:56] hmm. [12:56] I don't think CDs are water-resistant [12:56] err, DVDs [12:57] * joepie91 stares sadly at his Age of Mythology DVD [13:13] depends on the duration of contact with water [14:57] joepie91: AoM came out on DVD? When? My copies are CDs [15:00] turnip; budget bin combo edition [15:01] at least, that's what I recall [15:02] Yeah, I had the Gold Edition, which put it and the expansion on seperate CDs [15:02] turnip: http://i.imgur.com/0MCl7YX.jpg [15:02] it's not much easier to read in IRL than it is on the picture, by the way [15:02] the pseudo-holographic effect looks really cool [15:02] Wait, the fuck I mean "had"? I still have and play that jazz [15:02] but doesn't exactly help readability, lol [15:03] turnip: heh [15:03] but yeah [15:03] it has a ubisoft logo on it [15:03] for whatever reason [15:03] and it's definitely a DVD :P [15:03] That makes little to no sense, Ubi didn't make AoM [15:03] What creepy chinese convenience store did you get that from? [15:04] my Civilization IV disk similarly fails to be recognized, but there's no visible water damage and no serious scratch/foil damage, so not sure how that works :/ [15:04] turnip: heh [15:04] its absolutely a genuine disk [15:04] but it was from the budget bin at a local games + media store [15:04] probably some stupid re-publish scheme [15:04] where ubisoft buys bulk republishing rights for X time to try and squeeze the last money out of it [15:04] or whatever [15:04] Would make sense, after Ensemble closed [15:05] I've run across a few other things with a Ubisoft logo on it, where I just went "wait, what? the fuck does Ubisoft have to do with this game?" [15:05] But then I on't think you could put their logo on it [15:05] turnip: ensemble is still listed [15:05] see the picture [15:05] so they probably just added themselves as publisher [15:06] Modified: Today [15:06] GOSH THANKS THUNAR [15:06] how very detailed of you [15:07] * joepie91 digs into Brasero source [15:14] joepie91: GLaDOS: renew archivingyoursh.it [15:30] lol [16:49] well then [16:49] I've automated archiving CD-ROMs now [16:50] http://sprunge.us/fQTb?py [16:50] expects cdrdao to be installed [16:50] and relies on udisksctl for user-level unmounting, so probably needs GNOME or whatever too [17:00] should probably use some checksumming to verify rips? [17:00] and warn when there are audio tracks or other weird tracks [17:04] third dead disc D: [17:04] Schbirid: it should in theory image all tracks [17:05] also not sure how exactly to do checksumming, but afaik CD-ROMs have error correction (and the corresponding "read error" aborts) anyway... [17:05] unless I'm missing something [17:09] also, Schbirid, I should point out that I'm just replicating the Brasero imaging process [17:09] with this script [17:10] I do not like the sounds coming from my scanner [17:11] well for "proper" dumping i always look at http://redump.org/guide/cddumping/ , sigh and lock myself in a box [17:12] by proper i mean when i want to make archival grade images [17:17] midas: http://imgur.com/a/TqWVu [17:17] (cc Schbirid) [17:17] an excerpt of the stuff I'm imaging [17:18] :D [17:18] Schbirid: these are CD dump [17:18] dumps * [17:18] er [17:18] these are CD-ROM dumps [17:18] the guide is for CD dumps [17:18] CD dumps are done with RubyRipper and cdparanoia [17:18] with checks and evrything [17:18] :P [17:18] nice [17:18] afaik CD-ROMs do not require a separate verification due to their error correction mechanism [17:18] i have so many old mag discs :( [17:18] Schbirid: but? [17:18] wouldn't bet on that! [17:19] but there were many with mixed tracks, many dvds, little time and motivation [17:19] plus hallfiry has them all now anyways :) [17:19] Schbirid: what's the problem with having a lot of old mag discs [17:20] that's a great thing? [17:20] :P [17:20] clutter :( [17:22] my neighbour's router does not have good ad filtering rules [17:23] lol [17:23] on that note [17:23] if any NL peoples have old CDs/DVDs whatever [17:23] I'd gladly have them sent here, and archive them :D [17:23] rofl [17:24] * joepie91 ran out of CDs to scan [17:24] my archiving processes are starting to become disturbingly efficient... [17:26] did I mention how awesome gThumb is for post-processing scans of things that are not books or magazines? [17:26] it has a VERY efficient rotate tool [17:27] you pick 'parallel' or 'perpendicular', draw a line over a straight edge on the image (it's a several-pixels-thick color-negating line, so very easy to align) [17:27] and it rotates the entire image along that axis [17:27] virtually every CD label, for example, has a straight edge or straight-edged font somewhere on it [17:28] nice [17:28] that said, I'm going to have to rescan a few things when I get a new scanner... [17:30] bye [17:32] I usually do that with the ruler tool in photoshop [17:33] DFJustin: photoshops interface is not very suited towards batch work [17:33] gthumb's is :) [17:35] cool will have to check it out [17:36] I got a whole stack of shareware cds from my last thrift store visit [17:37] DFJustin: I've been considering checking out the thrift store here [17:37] but I don't exactly have any money to spend on it atm [17:37] :P [17:37] also been considering putting out an ad in a few (free) places for "send me your old CDs/DVDs" [17:38] well they're usually $1-2 apiece so it's not a real budget buster [17:41] one disc has been throwing errors... [17:41] DFJustin: $1-$2 per CD is a -lot- in my case [17:41] that's one dinner per CD [17:49] :o [17:49] such annoying, can't find my labels... [17:50] DFJustin: I live off very little money :P [17:50] ahh, found labels [17:55] joepie91: but you do what you like the most, archiving! (I think) [18:07] arkiver: I don't get paid for archiving :P [18:07] my low income has more to do with my unwillingness to write proprietary software [18:07] no I mean you don't make a lot of money, but you do what you like to do [18:08] eh [18:08] to a degree [18:08] not that you make a little money with archiving... :P [18:08] there are things I'd change, that I cannot currently change [18:27] Raw sense data: 0x70 0x00 0x04 0x00 0x00 0x00 0x00 0x0a 0x00 0x00 [18:27] SCSI command failed: sense key: 0x04: Hardware Error [18:27] 0x00 0x00 0x3e 0x02 0x00 0x00 [18:27] well, that's a new one [18:29] yay, a multi-track disc [18:29] the first one! [18:29] Railroad Tycoon II... [19:02] https://archive.org/details/ABNAmroEurostyleEJayMijnEigenDagboek [19:02] whoop whoop [19:35] oooo [19:35] bank cd [19:37] Flung it over to cdbbs [19:37] We will definitely need to regard the cdrom subcollections and things. [19:51] I've been doing some housekeeping but I don't know what your organizational vision is [19:53] Something akin to that storage container I imagine [19:54] * joepie91 pushes 12mbps to IA [19:54] SketchCow: I'm currently batch-uploading disc images [19:55] that is, bin/cue images plus CD label scans of a lot of original CD-ROMs [19:55] including a good amount of drivers, Dutch stuff, and otherwise unusual CD-ROMs [19:55] if possible, having an iso too is nice [19:55] that you're unlikely to find anywhere else :P [19:55] for browsability [19:55] DFJustin: might consider doing that later [19:56] currently prioritizing having a solid and complete all-tracks image online [19:56] before the CD-ROM dies altogether [19:56] these are all fairly old discs [19:56] 2000-2006 [19:56] I think there should be a separate driver discs collection, there are already a number of random ones in a couple places [19:56] so I'd rather not bet on their continued working-ness :) [19:56] DFJustin: I tag all my driver CD uploads with 'drivers' [19:57] perfect [19:57] also, I am starting to think that the few CD-ROMs that failed to be recognized... have some sort of weird copy protection [19:57] because they're all from similar publishers [19:58] that, or they all use a poor disc pressing mechanism [19:58] since it's also all budget discs [19:58] budget bin discs * [19:58] we only have a couple dutch cds it seems so more coverage of that area is great [19:58] DFJustin: yeah, as I mentioned before I'm actually considering putting out ads to collect old CDs/DVDs from NL people [19:58] there's likely to be a good bit of Dutch culture in there [19:59] https://archive.org/search.php?query=%28collection%3Acdbbsarchive%20OR%20collection%3Acoverdiscs%29%20AND%20language%3A%22Dut%22 [20:00] also, SketchCow, please tell whoever put together the HTML5 uploader for IA that I <3 their "network problem" resume [20:00] IA upload is about the only thing that doesn't roll over and die with my current internet connectivity [20:00] heh [20:01] DFJustin: haha, two of those are mine [20:01] a third in there isn't mine, but I have a disc from the same series laying around [20:01] not an original though [20:05] I also have a lot of budget bin stuff [20:05] such as https://archive.org/details/AirportTycoon2 [20:25] I am completely forgetting who wrote that. [20:26] ∏ [20:26] err [20:34] * joepie91 stares at his first stack of 24 scanned, imaged and labelled discs [20:34] archiving is fairly labour-intensive [20:35] uploaded: https://archive.org/details/ASRockAv27b [20:35] uploaded: https://archive.org/details/CasioFx9860G [20:35] uploaded: https://archive.org/details/ConstructionDestruction [20:36] :D [20:36] uploaded: https://archive.org/details/AsterixEnObelix [20:36] wow [20:36] thank you so much joepie!! [20:43] so apparently Cryo released their SCOL stuff as open-source (though I can't find it) [20:43] and Windward released Enemy Nations as open-source [20:43] never realized quite so many open-source releases after bankrupcy happened in the games world... [20:49] uploaded: https://archive.org/details/Creatures2Lifekit [20:49] uploaded: https://archive.org/details/ATICatalyst [20:51] ha, cant believe that itendifier was still available :D [20:51] if you want to go archiving crazy, most (all?) printed CDs have some strings in the middle. some etched into the plastic, some other in the silver [20:52] wow creatures 2. [20:53] * joepie91 is uploading a Freddi Fish [20:53] Schbirid: haha, wow, now that you mention it [20:53] sorryryyyy .P [20:53] creatures 2 was fucking awesome :D <3 [20:53] oh you meant Smiley [20:53] Schbirid: yeah, I might take note of those later [20:53] <£ [20:53] those strings aren't prone to DEATH AND SUFFERING so it's not a very urgent issue [20:53] my cyrix 333 couldn't keep up with how many i had [20:54] seeing as I keep a physical copy of all discs [20:54] far too good at breeding xD [20:54] Smiley: haha [20:54] seriously though [20:54] that was an impressive feat of gamedev [20:54] with the whole behaviour development thing [20:54] yup [20:54] bohren und der club of gore is so relaxing https://www.youtube.com/watch?v=yiBsJPEDJeg [20:55] god i'm so overpowered in this game now hahaha [21:00] I need to start a few more IA upload jobs [21:00] brb [21:02] nighty [21:02] uploaded: https://archive.org/details/ATIDrivers8.08 [21:11] I can't seem to push more than 12mbit to IA :/ [21:20] that's more than I can do... [21:38] uploaded: https://archive.org/details/EgypteGameNL [21:38] uploaded: https://archive.org/details/DeGroteBosatlas20062007 [21:38] DFJustin: I'm on (supposedly) 100mbit ftth [21:38] :P [21:38] with practical 55mbit up [21:39] http://boingboing.net/2004/02/05/worst-tos-on-the-ent.html [21:40] uploaded: https://archive.org/details/FreddiFishWaterwerken [21:43] Oh damn, Freddie Fish [21:43] Now get some Pajama Sam up in here [21:43] uploaded: https://archive.org/details/EpsonHarryPotterPrintStudio [21:43] turnip: don't have those :( [21:43] I do have one of the real freddi fish games though! [21:45] uploaded: https://archive.org/details/KorpsMariniersScreengamer [21:45] (yes, I'm on a spree) [21:45] I have a bunch of Humongous Entertainment games, I should uplaod those [21:45] turnip: YES [21:45] Putt Putt, Fatty Bear, Pajama Sam [21:46] oh man [21:46] putt putt [21:46] never heard of fatty bear though... [21:46] but putt putt and pajama sam! [21:46] man [21:46] https://ia600708.us.archive.org/20/items/staff2011/staff2011-2.jpg needs to be updated [21:46] I remember playing those demos [21:46] over and over again [21:50] uploaded: https://archive.org/details/EnemyNations [21:52] uploaded: https://archive.org/details/MultimediaPool [21:57] http://technologyadvice.com/iron-mountain-data-warehouse-burns-buenos-aires/ [21:57] OMG [21:57] turnip [21:57] Spy Fox! [21:57] uploaded: https://archive.org/details/FXFighterTurbo [21:59] (SketchCow, you reading? :P) [22:00] there's not really a good collection yet for commercial software discs [22:00] ivan`: very odd [22:01] someone might have wanted some old ownership claims gone [22:03] I'm idly watching [22:05] :) [22:06] ivan`: yeah, that was my impression [22:06] uploaded: https://archive.org/details/Moordspel [22:08] whoop, all my upload slots are busy [22:08] time to sort out some images again [22:36] joepie91: the best site for keeping track of games that have released their source code is probably: http://liberatedgames.com/ [22:37] so i just got an email [22:37] Hi Viddlers, [22:37] In 2006, Viddler’s founding business model was based on the creation of a community site for video enthusiasts and personal sharing. At the time, our business revenue model was driven through advertising. As a Viddler community user, you were a part of this model. As time has passed Viddler is no longer able to support this offering and business model. [22:37] Therefore we’ve made the decision to close our free site and community effective March 11th, 2014. [22:39] uploaded: https://archive.org/details/LectoriSalutemBoekenweek2000 [22:40] uploaded: https://archive.org/details/PackardBellFreddiFishRobotSpot (cc turnip) [22:40] dashcloud: thanks! I had no idea that existed [22:40] joepie91: Yeah I never had any Spy Fox games [22:41] incog: uh oh [22:41] (turnip: neither did I, but I loved the demos) [22:44] first 39 archived discs filed away! [22:44] whoo! [22:44] uploaded: https://archive.org/details/PackardBellActuaSoccer2 [22:53] uploaded: https://archive.org/details/OnTrackVWO1 [22:54] uploaded: https://archive.org/details/RacingSimulation2 [22:54] uploaded: https://archive.org/details/MystMasterpieceEdition [22:54] now, time to sleep :P [22:59] (uploaded: https://archive.org/details/PackardBellEncarta98) [23:10] ooh, I just saw this [23:10] http://theflashbulb.bandcamp.com/album/hardscrabble [23:11] or rather, heard, I guess