[00:52] Kevin Pereira's Imaginary Friend: https://archive.org/details/g4tv.com-video39529 [01:11] my cpu temp is at 64.5 C [01:12] never seen that before compiling firefox [01:50] godane: well hey, it's called *fire*fox [01:50] :P [01:59] its made to kill cpus then [04:22] so i found a very old websites about laser discs and stuff [04:22] Grab that shit [04:23] i'm mirroring could i want to see if i can past the 3585 files in wayback machine [04:24] this is the website: http://www.blam1.com/ [04:24] best to have a stand alone archive of it [04:27] it has bumpers of DiscoVision [04:27] in real media format [04:28] i think this was a database of laser discs [04:28] with reviews [04:29] niiice [04:29] lddb.com is another I think [04:32] looks like lddb.com was japanld.free.fr [04:33] http://web.archive.org/web/20060114075257/http://japanld.free.fr/ [04:33] that one doesn't existed anymore [04:34] must have been a old redirect since it had lddb.com in the page [04:34] the best part of old websites is that everything is on one domain [04:35] not freaking youtube redirect [04:35] no weird comments hosted on other sites [05:05] its over 70mb now [05:06] also i have past 3585 files in wayback machine [05:06] i'm at 4746 now [05:25] ok so its done [05:25] 5628 files in warc.gz [05:37] uploaded: https://archive.org/details/www.blam1.com-20130323 [05:43] its called the Blem Entertainment Group [05:49] so i found that discovision.com website is still alive [05:49] grabing it [05:49] lets see if it bets the 301 total urls in wayback machine [05:49] typo, should be Blam Entertainment Group [05:51] fixed [06:09] it only has 83 files [06:09] discovision.com that is [06:16] also know that blamld.com and blam1.com are the same [06:17] from what i could tell they bought blam1.com [06:17] maybe to stop a porn site or something [06:18] anyways even wayback doesn't have all the files under blamld.com host [06:29] uploaded: https://archive.org/details/www.discovision.com-20130323 [06:34] i'm now grabing cedmagic.com [06:37] its about this: http://en.wikipedia.org/wiki/Capacitance_Electronic_Disc [06:38] ceds are cool [09:43] kennethre: you around? [09:44] Nevermind! [12:09] hi there! [12:09] Konnichiwa [12:12] i'd like to dip into some of the archived bt internet dialup (http://archive.org/details/archiveteam-btinternet) stuff [12:13] i've obtained hanzo warc-tools, grepped thru the CDX files for stuff i'd like to get, and now I think I have some byte offsets for specific spots in specific warc files with the files I'd like to dip into [12:14] i don't fancy downloading the entire eleventy-billion gigabytes of warc files see ;o) [12:14] I think the IA servers support range requests [12:16] I'm struggling to see how to download specific parts of warc files - on a semi-automated basis - so I can unpack the files I'd like to see to my disk [12:17] I'm very new to the warc format and tools for working with it - do you guys know if there's a part of warc-tools (or some other nifty warc-friendly tool) which will do what I want? [12:19] I don't know about warc-tools, but basically you need to make a http request (be it with python's urllib or with curl) that tells the server to only return a specific range of bytes [12:20] hcurl -L -r 2000-5000 http://archive.org/download/archiveteam-btinternet-u-z/btinternet-u-z.megawarc.warc.gz > extract.warc.gz will fetch only bytes 2000-5000 from the given file [12:20] I think I can use wget or curl to specify a specific byte range to download, but I have a hunch I'll end up with just some data with no context, certainly not a valid warc which I can parse and extract data from? [12:21] ah. whoops - I was typing whilst you were answering. ;o) [12:21] A warc.gz file is basically a succession of warc records each individually gzipped, and then concatenated [12:21] As long as you start at the correct offset, it should work [12:21] oho, awesome sauce! [12:22] i'll give this a go and report back - thanks soultcer! [13:45] Here, have some light (20k words) reading of tech support stories http://www.reddit.com/user/jon6/submitted/ [13:45] There is great rage to be had [13:46] (despite the naming similarities it is different to BOFH) [13:51] similarly r/talesfromtechsupport [13:52] and r/cablefail [13:52] well, they are all submitted there, his user page is just a nice portal to list them all [13:57] hey everyone [13:57] i had to restart my cedmagic.com download [13:58] luckly i was only at 12mb and i just past that with out any long wait [13:58] my wifi droped in my sleep is the reason [13:59] so any, any of you know how to set up an EC2 instance with a GPU? [14:01] nope [14:01] they're not even on the damn lsits. [14:02] is there anywhere that WOULD know? [14:05] i found 10mins of news coverage [14:05] its from good day oregon [14:06] * nwh twitches [14:11] the video was with the guy that owns cedmagic.com [14:37] i'm past the number of files on wayback machine for cedmagic.com [15:48] is there a way to stop multiable / urls from downloading [15:53] i will see if adding /// to reject-regex works [15:54] Ah, you mean URLs which have multiple "/" in them [15:55] yes [15:55] I know heritrix has a filter for that, but I don't know anything for wget [15:55] it has reject-regex [18:12] GLaDOS: what's up? [19:00] kennethre: I think GLaDOS wanted to ask you about the ArchiveTeam warrior buildpack. The Python buildpack failed because of this https://github.com/heroku/heroku-buildpack-python/issues/79 [19:00] alard: ah well my response is the proper answer :) [19:00] But that's fixed now that the AT buildpack uses the latest Python-buildpack tag. [19:00] excellent [19:00] So I think GLaDOS is running one Yahoo Messages instance on Heroku now. [19:00] awesome [19:01] i was going to run some [19:01] soon [19:02] Cool. There's a strong competition this time. [21:22] http://i.imgur.com/z0R4kXI.jpg [21:22] lul wut [22:05] fuck knows [22:05] "i think i'm cool because i charged someone $24 for a dongle" ? [22:07] I was just thinking of the PyCon debacle the whole time [22:08] ersi: that too [22:16] this movie is kinda dope [22:16] Will Ferrel, time travel and dinosaurs - do I need to say more? [22:29] https://www.youtube.com/user/ISO8 who likes trains? ;) [22:29] I'm running low on disk after 422GB of k-pop [22:29] oooh, k-pop [22:30] hey! I've been on that user and watched some videos before [22:30] that was https://www.youtube.com/user/godmd6 which I have 1 copy of [22:30] there are at least two great cab view videos in ISO8 [22:31] https://www.youtube.com/watch?v=632rDJGrH1M https://www.youtube.com/watch?v=cW7IdpV49h0 [22:34] more, actually [22:45] huh, Jason Segal was in Slackers [22:53] I'm running low on disk after 422GB of k-pop [22:53] someone I know would virtually orgasm if he read this