[00:00] I think I should do it, more I think of it. [00:00] It's my fight. [00:01] ok [00:01] i have no idea how to that anyways [00:02] i'm build phantomjs to grab hackforums.net [00:02] it needs javascript in order to download the threads [00:16] *** RichardG has quit IRC (Read error: Connection reset by peer) [00:16] I'm wgetting all the forum posts and am running analytics against it. [00:16] We have some incredibly rude, incredibly vicious people attacking the design change. [00:16] I want to see how many they are. [00:17] *** RichardG has joined #archiveteam-bs [00:19] *** dashcloud has quit IRC (Read error: Operation timed out) [00:25] *** dashcloud has joined #archiveteam-bs [00:27] *** Start has joined #archiveteam-bs [00:32] *** mistym has quit IRC (Remote host closed the connection) [00:52] I got curious. Thanks for immunizing me against the forums again. [00:55] I'm amazed IA keeps the forums up [00:56] *** dashcloud has quit IRC (Read error: Operation timed out) [00:56] also, some guy is going to be added to so many spam lists, he gave almost all of his info in a post. [01:02] *** dashcloud has joined #archiveteam-bs [01:04] *** mutoso has quit IRC (Remote host closed the connection) [01:14] *** mistym has joined #archiveteam-bs [01:17] That's the revolutionary. [01:17] The angry one. [01:17] The ultra angry one. [01:22] *** username1 has joined #archiveteam-bs [01:24] *** schbirid2 has quit IRC (Read error: Operation timed out) [01:32] *** primus104 has quit IRC (Leaving.) [01:54] SketchCow: i'm uploading 1990 nytimes.com articles now [02:03] *** godane has quit IRC (Ping timeout: 306 seconds) [02:17] Wonderful [02:19] *** godane has joined #archiveteam-bs [02:33] *** BlueMaxim has joined #archiveteam-bs [03:01] best Chrome extension http://i.imgur.com/h6QTx7H.png [03:02] Snake People is great but little will beat Herp Derp [03:08] herp derp is pretty good, though I currently prefer shutup.css + stylish [03:17] *** mistym has quit IRC (Remote host closed the connection) [03:29] *** Pythia has joined #archiveteam-bs [03:37] *** mistym has joined #archiveteam-bs [04:16] *** SN4T14 has quit IRC (Ping timeout: 369 seconds) [04:35] *** aaaaaaaaa has quit IRC (Leaving) [04:42] *** mistym has quit IRC (Remote host closed the connection) [05:02] *** SN4T14 has joined #archiveteam-bs [05:37] *** mistym has joined #archiveteam-bs [05:40] https://www.youtube.com/watch?v=jaGk2_frk_s [06:28] Someone is trying to "helP" [06:28] By sending in, wow, just every single ROM ever [06:29] Unfortunately, many are ones already in the archive, some darked, some not. [06:29] Lots of duplicates. [06:39] I just wasted time writing a script going "is this in the archive?" [06:39] I guess somewhat useful, but janky [06:42] Surprised there wasn't one already [06:43] Because it can't quite do that. [06:43] Can't quite? [06:43] In fact, I deleted this script. [06:43] It's very loose. It can have endless false positives. [06:43] In this case, someone basically duplicated a collection. [06:45] Ah [06:47] Verified, 100% duplicated. What a waste. [06:48] Hmm. TOSEC could use an update. [06:50] Are you.. [06:50] Are you giving me things to do? [06:50] Don't do that. [06:51] Let's go with "Let's get at least one version of everything up off FOS" [06:51] Then we'll go for the pretty bows [06:51] dev/md0 9.0T 6.1T 3.0T 67% /0 [06:51] *** dashcloud has quit IRC (Read error: Operation timed out) [06:52] *** mistym has quit IRC (Remote host closed the connection) [06:52] I wasn't giving you something to do, I just looked up TOSEC seeing if there was a newer version [06:52] I'd do it myself if my upload wasn't absolute, complete pants [06:53] Someone uploaded Led Zepplin Bootlegs. [06:54] 204 of them. [06:54] Now to get them into archive.org for minimum pain [06:54] i did that one [06:54] is there anything you can't find [06:55] i only found it cause i found a bootleg of robin williams [06:55] after he died [06:58] https://archive.org/details/ledzepbootlegs&tab=about [07:00] *** dashcloud has joined #archiveteam-bs [07:02] These have to go in vaguely more carefully than I usually do. [07:03] But they at least are interesting and rewarding. [07:04] the rar files should have a txt file in them say how there were captured [07:04] Oh sure. [07:30] https://archive.org/details/Led_Zeppelin_-_1969-01-05_Whisky_A_Go-Go_Los_Angeles_CA_Eelgrassflac [07:34] Proof they work [07:41] Because wasting space is my hobby, I'm making a canonical "bootlegs" item with all the files in one place. [07:46] This is going to take a while. I've automated it as much as it can be, that is, mostly, with lots of little nudges. [07:46] Still, it's an interesting set - thanks, Godane [07:46] godane: One of our developers, Hank, wonders why you didn't make the original authors of the EPIC items into the "creator" field instead of EPIC. [07:49] i normally keep the creator for one set/collection the same [07:52] *** mistym has joined #archiveteam-bs [07:56] *** primus104 has joined #archiveteam-bs [07:56] *** mistym has quit IRC (Read error: Operation timed out) [08:00] ok i see another reason [08:00] some of the items have more then one author [08:00] so i can't put them in without get 400 bad request error [08:03] Anyway, I'm going to blast through these bootlegs as fast as I can. [08:03] Lots of them! [08:03] 200 or something. [08:08] some good news [08:08] i can add creators to keywords now [08:10] i'm add the creators into keywords for the 09xxxx items [08:11] i'm not likely to do the older ones cause i don't have the files for them anymore [08:23] I got my act together with the uploading. [08:24] So I can upload a pile of albums [08:26] *** Start has quit IRC (Read error: Connection reset by peer) [08:26] *** Start_ has joined #archiveteam-bs [09:02] any way to bypass a javascript required page with wget? [09:02] i'm trying to grab hackforums.net: http://www.hackforums.net/forumdisplay.php?fid=89 [09:03] but wget gives a file [09:06] *** Muad-Dib has quit IRC (Ping timeout: 252 seconds) [09:07] godane: you'll need to save the cookie [09:07] maybe pretend to be google [09:07] wget -O - -U "Mozilla Firefox" --no-cookies --header "Cookie: sucuri_uidc=0b6224952cc712746672ddbec7e8c258" "http://www.hackforums.net/forumdisplay.php?fid=89" [09:11] Piles of albums are increasing [09:12] garyth: that didn't work [09:12] *** Smiley has quit IRC (Remote host closed the connection) [09:12] also i was using load-cookies cause i have cookies.txt as a text file [09:17] that does work, but you'll need to manually add the sucuri_uidc to that cookies.txt [09:18] that is in the cookies.txt [09:18] *** Smiley has joined #archiveteam-bs [09:20] anyways i'm continuing my grab of nytimes.com [09:22] wget --quiet --save-cookies=cookies.txt "http://www.hackforums.net/forumdisplay.php?fid=89" -O- | sed -nr 's#.*sucuri_uidc=([a-z0-9]+).*#.hackforums.net\tTRUE\t/\tFALSE\t1464168038\tsucuri_uidc\t\1#pg' >> cookies.txt [09:22] wget --quiet --load-cookies=cookies.txt "http://www.hackforums.net/forumdisplay.php?fid=89" -O- [09:22] there's my ugly oneliner ;) [09:24] now it works [09:24] thanks [09:24] no problem [09:28] now some bad news [09:28] its giving me the login screen [09:36] *** Start_ has quit IRC (Read error: Connection reset by peer) [09:37] *** Start has joined #archiveteam-bs [09:39] Great news, I've nailed the Led Zepplin thing pretty well. [09:45] great [09:45] i'm about half way thur nytimes.com 1992 articles [10:01] SketchCow: What happens if copyright holder complains about uploaded material refering to DCMA? [10:02] *referring [10:02] Moon falls out of sky [10:02] literally the moon [10:02] literally the sky [10:02] but seriously? [10:04] Someone has written a multi-threaded Java VM in Javascript [10:07] found: http://www.copyright.gov/1201/docs/1201_recommendation.pdf [10:09] so if a game was designed for Windows ME (which is unsupported) but runs on Windows 8, can i still upload it? [10:15] so looks like maybe able to get esctasy for social anxiety: http://www.reddit.com/r/worldnews/comments/3795r5/ecstasy_may_soon_be_a_treatment_for_social/ [10:16] thats just weird to me though [10:33] *** Muad-Dib has joined #archiveteam-bs [10:58] *** zenguy_pc has quit IRC (Read error: Operation timed out) [11:04] godane: eh!? -- you had any medication before? [11:17] xtc is great tho [12:01] *** zenguy_pc has joined #archiveteam-bs [12:07] i think prozac in liquid form when i was 17 [12:07] that last for 3 to 5months then i stop taking it [12:14] *** BlueMaxim has quit IRC (Quit: Leaving) [12:30] midas: ? [12:32] Smiley: xtc as a drug, i liked it :p [12:33] * Smiley googles [12:33] hugdrug [12:33] oh right [12:38] *** dashcloud has quit IRC (Read error: Operation timed out) [12:50] *** sankin has joined #archiveteam-bs [12:54] *** dashcloud has joined #archiveteam-bs [13:55] *** Start has quit IRC (Disconnected.) [13:57] *** mistym has joined #archiveteam-bs [14:02] *** mistym has quit IRC (Ping timeout: 252 seconds) [14:25] *** zenguy_pc has quit IRC (Ping timeout: 258 seconds) [14:25] *** zenguy_pc has joined #archiveteam-bs [14:41] *** Smiley has quit IRC (Read error: Operation timed out) [14:41] *** Start has joined #archiveteam-bs [14:46] *** ohhdemgir has joined #archiveteam-bs [14:46] *** mistym has joined #archiveteam-bs [15:20] *** primus104 has quit IRC (Leaving.) [15:24] useretail: just upload it anyway, no one will care [15:27] *** Muad-Dib has quit IRC (Ping timeout: 252 seconds) [15:44] so now that Hot Topic owns ThinkGeek this must mean that Linux is cool [15:46] 2015: Year of the Linux T-Shirt [15:49] yipdw: whut [15:49] http://arstechnica.com/business/2015/05/hot-topic-enters-agreement-to-buy-thinkgeek-parent-company-geeknet-inc/ [15:50] *** Muad-Dib has joined #archiveteam-bs [15:50] *** Start has quit IRC (Disconnected.) [15:53] *** mistym has quit IRC (Remote host closed the connection) [15:56] Well I'll be. [15:56] As comments make clear, they laid off for buyout some time ago. [15:57] *** Start has joined #archiveteam-bs [15:58] *** Sellyme has quit IRC (No Ping reply in 180 seconds.) [16:00] bootlegs finished!! [16:01] That was a lot of work but nice. https://archive.org/details/ledzepbootlegs [16:01] *** Start has quit IRC (Client Quit) [16:09] *** mistym has joined #archiveteam-bs [16:16] *** Start has joined #archiveteam-bs [16:19] *** aaaaaaaaa has joined #archiveteam-bs [16:19] *** Sellyme has joined #archiveteam-bs [16:25] DFJustin: This is your Balliwick - people just turned me on to two 300mb+ packs of Japanese MODs [16:27] as in tracker music? didn't know there was a japanese scene for that actually [16:29] other than famitracker type stuff [16:30] Yes, apparently 1. There is, 2. It was nearly lost 3. I found a guy who makes it his area of study [16:30] Crazy, huh [16:30] https://twitter.com/aka_obi [16:30] https://t.co/KhYlmfFywa [16:30] oh I saw this guy he follows me [16:31] for some reason [16:31] I'd love him on our side. [16:31] His depth into Japanese demoscene and modscene could be very good for us adding it [16:32] I uploaded some x68k and fm towns CDs with a bunch of that kind of thing [16:33] but I'm limited to what people have already ripped elsewhere so yeah would be excellent to have someone in-country on it [16:45] *** Start has quit IRC (Disconnected.) [16:55] *** dashcloud has quit IRC (Read error: Connection reset by peer) [17:00] *** godane has quit IRC (Read error: Operation timed out) [17:05] *** dashcloud has joined #archiveteam-bs [17:14] *** username1 is now known as schbirid [17:16] someone please take http://agar.io/ from me :( [17:21] schbirid: why what is this [17:21] why did you link me this again [17:21] I mean not you specifically [17:22] *** dashcloud has quit IRC (Read error: Operation timed out) [17:23] *** primus104 has joined #archiveteam-bs [17:24] xmc: super addicting game [17:24] once you learned it [17:24] oh [17:24] uhhh ok [17:31] *** dashcloud has joined #archiveteam-bs [17:31] and very infuriating [17:31] The obvious solution is to unlearn it by banging your head into the wall. [17:34] On it [17:50] *** Start has joined #archiveteam-bs [17:51] *** dashcloud has quit IRC (Read error: Operation timed out) [17:52] *** primus104 has quit IRC (Leaving.) [17:55] *** dashcloud has joined #archiveteam-bs [18:18] *** Ravenloft has joined #archiveteam-bs [18:20] https://youtu.be/mv87xOccCK8 [18:22] *** godane has joined #archiveteam-bs [18:23] spam? [18:24] no [18:24] just a share [18:34] possibly incomplete archive coverage for http://puu.sh/5RPJq [18:34] blocked by robots.txt, but aside from that, there SHOULD be an image in the archive of "some twitch livestreamer's nipple" [18:34] but there isn't [18:34] so if somebody has puush stuff laying around, maybe check whether that all worked correctly [18:35] idk the exact upload date, so I can't determine reliably whether it should've been caught in the archival effort [18:36] ohai [18:36] joepie91: That puush could not be found. [18:37] *** Start has quit IRC (Read error: Connection reset by peer) [18:59] *** schbirid has quit IRC (Quit: Leaving) [19:07] *** dashcloud has quit IRC (Read error: Operation timed out) [19:11] *** dashcloud has joined #archiveteam-bs [19:11] *** deathy_ is now known as deathy [19:11] joepie91: image-disc says "## ERROR: Unrecognized tracks found on CD! Please report this as a bug." [19:28] *** Start has joined #archiveteam-bs [19:30] *** Start has quit IRC (Client Quit) [19:45] *** ripvanwin has joined #archiveteam-bs [19:50] *** primus104 has joined #archiveteam-bs [19:55] *** Smiley has joined #archiveteam-bs [20:22] ha nice [20:22] http://www.iana.org/domains/root/db/men.html [20:27] hmmmm [20:28] where do i sign up for a .men domain [20:29] Friends have verified notall.men has been taken [20:30] fuckkkk [20:30] by whom though [20:34] so i have uploaded 1995 nytimes.com articles [20:35] i'm working on getting 1996 urls [20:46] *** sankin has quit IRC (Leaving.) [21:40] *** BlueMaxim has joined #archiveteam-bs [21:54] *** Ravenloft has quit IRC (Ping timeout: 265 seconds) [22:00] *** mistym has quit IRC (Remote host closed the connection) [22:06] *** Panasonic has joined #archiveteam-bs [22:16] *** dashcloud has quit IRC (Read error: Operation timed out) [22:17] *** mistym has joined #archiveteam-bs [22:20] *** dashcloud has joined #archiveteam-bs [22:23] *** godane has quit IRC (Ping timeout: 265 seconds) [22:36] *** Panasonic is now known as Ravenloft [23:22] *** godane has joined #archiveteam-bs [23:23] *** Start has joined #archiveteam-bs [23:29] *** Ravenloft has quit IRC (Ping timeout: 370 seconds) [23:31] SketchCow: we really need to find msn install discs [23:31] i think there hard to find then aol cds [23:32] i uploaded my msn disc: https://archive.org/details/cdrom-msn-internet-access-6.0 [23:32] tons of trailers for video games on tehre [23:32] *there