[00:06] JW_work: have you also seen CodeArchive https://the-code-archive.launchrock.com/ ? It's backing up every GitHub repo with 10 stars or more [00:07] yep, I added that to the page already [00:28] Eventually the admins are going to realize I keep swapping between two accounts I have to jam the queue to max for either account [00:28] But not toooooddaaayyyyyyyyyyyyyyyyyyyyy [00:28] swap swap swap boom [00:29] Even though I've not said much, I'm glad we're grabbing as much audio as we are. [00:36] *** Coderjoe_ has joined #archiveteam-bs [00:37] *** DoomTay has joined #archiveteam-bs [00:39] *** Coderjoe has quit IRC (Read error: Operation timed out) [00:44] *** RichardG_ has joined #archiveteam-bs [00:45] *** RichardG has quit IRC (Read error: Connection reset by peer) [00:48] *** JesseW has joined #archiveteam-bs [00:51] *** DoomTay has quit IRC (Ping timeout: 268 seconds) [00:53] *** DoomTay has joined #archiveteam-bs [01:01] *** RichardG_ is now known as RichardG [01:20] *** RichardG_ has joined #archiveteam-bs [01:20] *** RichardG has quit IRC (Read error: Connection reset by peer) [01:28] *** RichardG has joined #archiveteam-bs [01:29] *** RichardG_ has quit IRC (Ping timeout: 255 seconds) [01:31] Sci-hub twitter account is down [01:33] nvm. I was wrong. [01:34] actually, I think I might be right. [01:35] it sometimes loads and sometimes doesn't load https://twitter.com/Sci_Hub [01:35] all of twitter is having issues at the moment [01:36] *** DoomTay has quit IRC (Ping timeout: 268 seconds) [01:36] oh, that's good. well, not good. [01:48] is vk.com like facebook? [01:51] Oh thank fucking god twitter is dead [01:54] vk.com started off as russian rip of facebook, but they have added a bunch of russia specific stuff and taken it in completely different direction [01:56] *** RichardG_ has joined #archiveteam-bs [01:56] *** RichardG has quit IRC (Ping timeout: 370 seconds) [01:57] *** Ravenloft has joined #archiveteam-bs [02:00] is there i should create an account on their? [02:03] *a reason i should [02:10] *** Start_ has joined #archiveteam-bs [02:10] *** Start has quit IRC (Read error: Connection reset by peer) [02:10] if you have russian friends, that's a good reason [02:11] * SketchCow wanders into the street, blinking [02:11] i don't have russian friends :P [02:11] * SketchCow meets others, sans twitter, freed [02:11] CAN YOU [02:11] FEEL A [02:11] BRAND NEW DAY [02:12] https://www.youtube.com/watch?v=SsgO_zQoQdI [02:12] not yet, ask again in 2 hours [02:12] "SketchCow wanders into the street, blinking" sounds like someone who is playing Pokemon GO. [02:13] Down to 479mb of inbox [02:13] (For FTP) [02:15] Actually, it's about to become 0 [02:15] This last one was stupid [02:15] Another example of why it was sticking around. [02:15] It was the High Voltage SID COllection, and this other Atari collection [02:15] not as a series of zips or anything, but as individual files. [02:15] 80,000 [02:16] Murderous. I'm deleting them and deleting the items so far. [02:16] And with that, we've gone from 1.8tb of inbox to 0k of inbox. [02:16] Now I will focus on some other chunks on there that are entirely of my own doing. [02:17] The drive went from 88% to 68% [02:17] 6.2tb of "what the fuck has jason done" [02:17] (Left) [02:30] *** RichardG has joined #archiveteam-bs [02:30] *** RichardG_ has quit IRC (Ping timeout: 255 seconds) [02:42] https://archive.org/details/www.sbs.com.au-news-node-100001-to-109999-odd-numbers-20160815 [02:43] at least www.sbs.com.au will have some sort of full grab [02:43] i'm doing it so i can grab the mp3 urls of SBS World News [02:44] anyways i'm going to bed [02:44] bbl [03:04] *** DoomTay has joined #archiveteam-bs [03:06] SketchCow: Does it feel like cleaning out an old hard drive where people have just put all of their files in random spots? [03:10] hook54321: i don't see how it couldn't [03:12] I've got some hard drives like that at my house. It's so annoying, I can't tell what's taking up so much space. [03:12] I'm tempted to just buy some new hard drives. [03:24] *** achip has joined #archiveteam-bs [03:27] OK, hip hop project now is down to 14gb. It has to stay there - it's trying to run a pile of old torrents, and some of them are incomplete. One gets completed every few days. Very sad, there's 11,000 of them [03:28] A shame how many of the old torrents are dead, dead dead [03:28] This PARTICULAR one is not set up DHT [03:28] So I might set up one that's DHT on another box, and then let it join. [03:28] It'll help with the one that's not joined. [03:28] Sooper Genius [03:29] Thank god, saving the hiphops [03:29] 1,302,944 songs about purple drank [03:31] SketchCow: how big are the torrents? [03:32] Like what [03:33] Like what's the total size of the files? [03:33] Approximately [03:38] hahhaha [03:39] I just watched my two rtorrents have sex with each other [03:39] I think they're smoking a cigarette now [03:41] If the torrent is less than 5 GB try using seedr.cc . I've noticed that some torrents will be done instantly even if they are dead. My hypothesis is that if someone else has downloaded it, it won't need to redownload it. [03:45] *** fie has quit IRC (Read error: Operation timed out) [03:56] "Copyrights prior to 1923 have expired, not including copyrights on sound recordings published prior to February 15, 1972, covered only under state laws." - https://en.wikipedia.org/wiki/List_of_countries%27_copyright_lengths [03:56] SketchCow: that makes a good quote :p [04:03] I finally spent some time on this. [04:03] I am now of the opinion that possibly, some of these mixtapes are kind of lost. [04:04] Damn [04:06] *** Aranje has quit IRC (Quit: Three sheets to the wind) [04:07] 11,391 mixtapes I am now going through the torrents of. [04:07] So then, the next thing to look for, is if someone has a share or a hard drive or something. [04:23] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:26] *** BlueMaxim has joined #archiveteam-bs [04:28] *** tomwsmf has quit IRC (Ping timeout: 255 seconds) [04:29] *** Sk1d has joined #archiveteam-bs [05:17] *** superkuh has quit IRC (Quit: the neuronal action potential is an electrical manipulation of reversible abrupt phase changes in the lipid bilaye) [05:27] *** superkuh has joined #archiveteam-bs [05:35] by the way, this is very useful. http://www.chrisains.com/seo-tools/extract-urls-from-web-serps/ [06:00] *** RichardG has quit IRC (Ping timeout: 244 seconds) [06:01] If an SEO is blogging about it, it's 109 years old and we already do it [06:04] specifically, I think achip has scripts to do that, but I don't know if he has made them available [06:34] JesseW: like, basically the same thingas that, or more automated/thorough? [06:35] yep [06:37] yep to both? [06:38] *** DoomTay has quit IRC (Quit: Page closed) [06:39] Found a 1.8tb download I did [06:39] That's gonna sting [06:41] "found"? [06:41] what was it? [06:43] Well, I'm going through my drive [06:43] finding projects [06:43] old software and OS's [06:43] 1.8tb of old software and OS's? that certainly is ... quite a bit [06:44] Who messes around. Not me. [06:44] 17gb collection of 1980s Macintosh Software [06:55] *** kristian_ has joined #archiveteam-bs [06:58] I think I'm in love. [07:00] *** JesseW has quit IRC (Ping timeout: 370 seconds) [07:01] congrats [07:01] https://archive.org/details/2014_VETUSWARE_Mirror [07:06] you guys know a way I can archive this fourm https://www.rossmanngroup.com/boards/ [07:07] its a paywall fourm [07:07] that I have a subscription to [07:08] SketchCow: I’m still sitting on the waffleimages dump. Is the reconstructed WARC (1 of 256 directories) I uploaded suitable for inclusion into the wayback machine? https://archive.org/download/img-waffleimages-com [07:11] I'll fling it in [07:15] Sounds like yes. I’ll WARC and upload the remaining images then. [07:32] https://archive.org/details/Extremely_Large_Early_Macintosh_Software_Collection [07:34] archive.org/details/Old_PC_Drivers_Collection [07:48] Putting another call out. We really could do with a few more newsbuddy grabbers. If anyone has a fast, stable connection and is willing to help, just come into #newsgrabber and let myself or arkiver know please [07:53] My high school has a really fast internet connection, but it uses OpenDNS unfortunately. [07:54] Would your School be fine with a crawler running? [07:54] And can you open ports in the firewall? [08:03] AmigaOS Apple Rhapsody Clonezilla IBM OS2 MPM NextSTEP QNX Solaris [08:03] Apple Darwin Apple UNIX DOS IBM PS MS Windows Novell Netware Ramfoos2 Xenix [08:03] Apple Lisa BeOS FreeBSD IS_DOS Mikrotik_RouterOS OS DOS Raspberry Pi source [08:03] Apple Mac OS CPM Gparted Kolibri Mikrotik_SwOS PcBSD ReactOS [08:03] Apple MacWorks Cisco HirensBootCD Linux NAS4Free PlamOS Reanimator [08:03] Wheeeeeeeeeeeeeeeee [08:05] du -sh . [08:05] 771G . [08:05] We call it "Let's make a 771gb Item" [08:07] *** kristian_ has quit IRC (Remote host closed the connection) [08:38] *** altlabel_ is now known as altlabel [08:40] *** Honno has joined #archiveteam-bs [09:14] *** schbirid has joined #archiveteam-bs [10:11] *** RichardG has joined #archiveteam-bs [10:22] *** RichardG has quit IRC (Ping timeout: 370 seconds) [11:09] *** RichardG has joined #archiveteam-bs [12:24] *** Ravenloft has quit IRC (Read error: Connection reset by peer) [12:41] Has anyone been archiving pastebins? I've got a few things I'd like to search for. [12:54] *** BlueMaxim has quit IRC (Quit: Leaving) [12:57] ravetcofx: save your cookies from your browser somehow, then use grab-site with a wpull arg for load-cookies or such [12:57] ravetcofx: ofc that means it will contain your cookie [12:57] (the archive) [13:09] paste bins would need to be continuously scanned [13:10] I rarely let my pastes live more than a week [13:12] and usually private [13:14] great, the forum.openstreetmap.org wget segfaulted [13:15] log says "FINISHED --2016-08-16 00:24:36--" though [13:15] wat [13:28] seems about right [13:54] *** brayden_ has joined #archiveteam-bs [13:54] *** swebb sets mode: +o brayden_ [13:59] *** brayden has quit IRC (Read error: Operation timed out) [15:53] *** DoomTay has joined #archiveteam-bs [16:05] Now that I think about it, it's kinda hilarious that Jason somehow concluded Twitter was "dead" just because the site was having problems [16:50] *** Start_ is now known as Start [17:18] Sigh. [17:18] *** SketchCow sets mode: +b *!*webchat@*.res.bhn.net [17:18] *** DoomTay was kicked by SketchCow (DoomTay) [17:18] It was a joke. [17:18] Hey [17:18] * SketchCow waves hand [17:18] Why can't you hear me [17:35] *** schbirid has quit IRC (Quit: Leaving) [17:36] Anyway. [17:36] Tumblr discovered the MS-DOS emulation on Internet Archive. The day is quite frisky, traffic wise. [17:45] SketchCow: oh, have a link? (to the tumblr things) [17:53] thanks joepie91 [17:57] *** tomwsmf has joined #archiveteam-bs [17:57] It's all over the place. [17:57] Nothing informative. [17:58] http://imgur.com/gallery/Uo6svOV is the imgur "discussion" [18:03] How much traffic can IA handle? [18:04] they're currently pushing out about 30 gigabits https://monitor.archive.org/weathermap/weathermap.html [18:05] *** REiN^ has quit IRC (Read error: Operation timed out) [18:08] hook54321: specifically, the monitor you want to look at for IA outbound traffic: https://monitor.archive.org/cacti/graph.php?action=view&local_graph_id=6467&rra_id=all [18:10] holy cow. how much bandwidth does that require? [18:19] hook54321: currently, 29.52 gigabits per second, or thereabouts. [18:32] *** barblefis has joined #archiveteam-bs [18:55] *** REiN^ has joined #archiveteam-bs [18:58] *** mutoso has quit IRC (Ping timeout: 250 seconds) [19:04] *** barblefis has left [19:06] *** DoomTay has joined #archiveteam-bs [19:08] *** kristian_ has joined #archiveteam-bs [19:12] *** mutoso has joined #archiveteam-bs [19:30] *** DoomTay has quit IRC (DoomTay) [19:35] Anyone uploaded the Shadow Broker exploits yet? [19:37] *** DoomTay has joined #archiveteam-bs [19:40] *** kristian_ has quit IRC (Leaving) [19:57] *** JW_work1 has joined #archiveteam-bs [20:00] *** JW_work has quit IRC (Read error: Operation timed out) [20:28] *** Stiletto has quit IRC (Read error: Connection reset by peer) [20:29] *** Stiletto has joined #archiveteam-bs [20:34] *** kristian_ has joined #archiveteam-bs [20:35] *** JW_work1 has quit IRC (Quit: Leaving.) [20:37] *** JW_work has joined #archiveteam-bs [20:49] *** JW_work has quit IRC (Quit: Leaving.) [20:49] *** JW_work has joined #archiveteam-bs [20:57] *** DoomTay has quit IRC (DoomTay) [21:18] *** JW_work has quit IRC (Quit: Leaving.) [21:18] *** JW_work has joined #archiveteam-bs [21:19] *** zenguy has quit IRC (Ping timeout: 246 seconds) [21:22] *** DoomTay has joined #archiveteam-bs [21:22] *** JW_work has quit IRC (Read error: Connection reset by peer) [21:22] *** JW_work has joined #archiveteam-bs [21:34] 3,750 PDFs of Manuals being added [21:44] NICE [21:48] https://www.minnpost.com/business/2016/08/rise-and-fall-gopher-protocol [22:03] *** SmileyG has joined #archiveteam-bs [22:03] *** Smiley has quit IRC (Read error: Connection reset by peer) [22:09] *** DoomTay has quit IRC (DoomTay) [22:27] *** zenguy has joined #archiveteam-bs [22:27] So, as expected.... piles and piles and piles of takedown notices on the stuff I uploaded. [22:28] thanks for the link joepie91 [22:28] lol SketchCow what are they manuals for? [23:58] SketchCow: takedown noticed on the manuals? [23:58] *** Honno has quit IRC (Read error: Operation timed out)