[00:00] *** Jonimus has quit IRC (Ping timeout: 370 seconds) [00:07] *** philpem has quit IRC (Ping timeout: 252 seconds) [00:07] *** logchfoo has quit IRC (Read error: Connection reset by peer) [00:09] *** logchfoo_ starts logging #archiveteam at Mon Jun 15 00:09:07 2015 [00:09] *** logchfoo_ has joined #archiveteam [00:11] *** Jonimus has joined #archiveteam [00:25] does anyone archive 4chan archive sites? [00:25] like warosu, archive.moe, etc [00:25] or have any software relating to it [00:26] *** signius has quit IRC (Ping timeout: 512 seconds) [00:26] So far archivers have asked other archivers to share their dumps, and they have in most part. [00:27] Unfortunately some do not effort to dump whole images, only thumbnails. [00:27] I don't think many dumps are available on Internet Archive, at least I think Install Gentoo archive is [00:27] For what it's worth, 4plebs is the only active /pol/ archiver [00:28] Blade @ Love is Over said to maybe archive /pol/ but that never happened. anounyym1 @ Rebecca Black Tech is mostly uninterested to archive /pol/. [00:28] I mean, Blade went as far as asking 4plebs admin for a dump and I think he received one without full images, but never imported it. [00:28] thats a bit disappointing to me [00:29] *** primus104 has quit IRC (Leaving.) [00:29] I think some archives have been put into ArchiveBot queue before [00:29] its kind of sad to me that more isnt done to archive 4chan, now and in the past, i think its an important part of internet history (or it has been for me, ive been going there for almost ten years now, or a bit less than half my life) [00:30] thats what the archive sites are for, no? [00:30] For what it's worth, I think anounyym1 @ RBT is easiest to work with from my experience. [00:30] not all boards get archived and many are only partial it seems [00:31] and the very early posts on 4chan seem to be lost completely [00:31] Yeah, there's some boards with no active archivers I believe [00:31] i dont think anyone archives /b/ [00:32] sunnymilk: There is one [00:32] really? [00:32] fgts.jp [00:32] No images though [00:32] oh sweet, thank you [00:32] ahhh [00:32] heh, imagine the legal issues with running and hosting a /b/ archive [00:32] that's why nobody does it [00:32] yeah :( [00:33] i want to archive 4chan but i dont want to run into legal trouble [00:33] at least at one time one of them did /b/, but only specific posts that users told it to save [00:33] Generally there are limited liability protections in laws for archivers [00:33] kniffy yeah i remember that one [00:33] As long as you act timely on notice to remove illegal content [00:33] i wish 4chan just didnt delete things [00:33] i think its a bit silly [00:33] Also, ArchiveTeam's 4chan page is inaccurate in many places [00:34] http://archiveteam.org/index.php?title=4chan [00:34] theres an attitute among the people that run it that youre not "supposed" to archive 4chan and things are supposed to be ephemeral [00:34] deleting stuff keeps it moving [00:34] * Kazzy notes we're moving into lengthy discussion, suggests #archiveteam-bs [00:34] but why nobody archives, idk [00:35] *** signius has joined #archiveteam [00:45] *** RichardG has quit IRC (Remote host closed the connection) [00:48] *** mistym has quit IRC (Remote host closed the connection) [01:02] *** RichardG has joined #archiveteam [01:07] *** signius has quit IRC (Ping timeout: 512 seconds) [01:16] *** signius has joined #archiveteam [01:16] *** username1 has joined #archiveteam [01:18] *** schbirid has quit IRC (Read error: Operation timed out) [01:18] *** schbirid2 has quit IRC (Read error: Operation timed out) [01:19] *** schbirid has joined #archiveteam [01:19] *** mistym has joined #archiveteam [01:24] *** wp494 has quit IRC (LOUD UNNECESSARY QUIT MESSAGES) [01:25] *** wp494 has joined #archiveteam [01:47] Colt Defense (a gun manufacturer) is apparently intending on doing a chapter 11 bankruptcy tomorrow: http://www.thefirearmblog.com/blog/2015/06/14/breaking-news-colt-to-file-chapter-11-bankruptcy/ [01:49] *** signius has quit IRC (Ping timeout: 512 seconds) [01:53] *** RichardG has quit IRC (Read error: Connection reset by peer) [01:54] *** schbirid2 has joined #archiveteam [01:55] *** username1 has quit IRC (Read error: Operation timed out) [01:57] *** schbirid has quit IRC (Ping timeout: 265 seconds) [01:57] *** schbirid has joined #archiveteam [01:58] *** signius has joined #archiveteam [02:04] *** RichardG has joined #archiveteam [02:20] *** sirdancea has quit IRC (Read error: Operation timed out) [02:30] *** signius has quit IRC (Ping timeout: 512 seconds) [02:39] *** signius has joined #archiveteam [02:48] *** marvinw has quit IRC (Read error: Operation timed out) [02:48] *** RichardG_ has joined #archiveteam [02:49] *** Gfy has quit IRC (Read error: Operation timed out) [02:49] *** Gfy has joined #archiveteam [02:49] *** Coderjoe has quit IRC (Read error: Operation timed out) [02:49] *** Coderjoe has joined #archiveteam [02:49] *** balrog has quit IRC (Read error: Operation timed out) [02:49] *** winr4r has quit IRC (Read error: Operation timed out) [02:49] *** aMunster has quit IRC (Read error: Operation timed out) [02:49] *** sep332 has quit IRC (Read error: Operation timed out) [02:49] *** phuzion has quit IRC (Read error: Operation timed out) [02:49] *** nwf has quit IRC (Read error: Operation timed out) [02:49] *** mr_rippit has quit IRC (Write error: Broken pipe) [02:50] *** mistym_ has joined #archiveteam [02:50] *** toad1 has quit IRC (Read error: Operation timed out) [02:51] *** lysobit has quit IRC (Read error: Operation timed out) [02:51] *** achip has quit IRC (Read error: Operation timed out) [02:52] *** balrog has joined #archiveteam [02:52] *** swebb sets mode: +o balrog [02:52] *** RichardG has quit IRC (Read error: Operation timed out) [02:53] *** lysobit has joined #archiveteam [02:54] *** mistym has quit IRC (Read error: Operation timed out) [02:54] *** Command-S has quit IRC (Read error: Operation timed out) [02:57] *** bzc6p has quit IRC (Ping timeout: 600 seconds) [02:58] *** vegbrasil has quit IRC (Ping timeout: 600 seconds) [02:59] *** nwf has joined #archiveteam [02:59] *** phuzion has joined #archiveteam [02:59] *** Control-S has joined #archiveteam [02:59] *** xtr-201 has quit IRC (Ping timeout: 370 seconds) [02:59] *** ripvanwin has joined #archiveteam [02:59] *** vegbrasil has joined #archiveteam [02:59] *** achip has joined #archiveteam [03:00] *** sep332 has joined #archiveteam [03:02] *** aMunster has joined #archiveteam [03:02] *** winr4r has joined #archiveteam [03:03] *** marvinw has joined #archiveteam [03:07] *** toad1 has joined #archiveteam [03:11] *** signius has quit IRC (Ping timeout: 512 seconds) [03:12] *** Emcy has quit IRC (Read error: Connection reset by peer) [03:17] *** BlueMaxim has quit IRC (Ping timeout: 512 seconds) [03:17] *** BlueMaxim has joined #archiveteam [03:20] *** signius has joined #archiveteam [03:20] *** RichardG_ has quit IRC (Ping timeout: 370 seconds) [03:30] *** xtr-201 has joined #archiveteam [03:50] *** aaaaaaaaa has quit IRC (Leaving) [03:52] *** signius has quit IRC (Ping timeout: 512 seconds) [03:55] *** foobla232 has joined #archiveteam [03:56] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [03:57] prithee please? [03:57] with cherries on top? [04:01] *** signius has joined #archiveteam [04:06] foobla232: yahoosucks [04:06] lol [04:06] thank you [04:06] is it weird that this channel has 200 people in it and nobody is talking? i don't use IRC much [04:07] it's midnight in the US right now [04:08] 5am in the UK [04:09] got woken up by stray cat \o/ [04:09] also, 164 [04:09] foobla232: this channel intentionally doesn't get much traffic, there's topic-specific channels for most things [04:09] 10 PM MST and 9 PST - well that's fair. [04:09] thanks again. good night [04:09] *** foobla232 has quit IRC (Remote host closed the connection) [04:17] *** JesseW has quit IRC (Quit: Leaving.) [04:19] *** JesseW has joined #archiveteam [04:34] *** signius has quit IRC (Ping timeout: 512 seconds) [04:42] *** signius has joined #archiveteam [04:49] *** useretail has quit IRC (Dreaming in digital. Living in real-time. Thinking in binary. Talking in IP.) [04:59] *** Aranje has joined #archiveteam [05:26] We're all whispering to each other. [05:26] About ou. [05:27] Also, the POMF storm is over, FOS is now returning to normal. [05:31] Was http://yahoolabs.tumblr.com/post/89783581601/one-hundred-million-creative-commons-flickr-images ever archived? [05:34] http://archiveteam.org/index.php?title=Picasa expanded [05:38] Added to http://archiveteam.org/index.php?title=Deathwatch#2015 [05:45] *** signius has quit IRC (Ping timeout: 512 seconds) [05:54] *** signius has joined #archiveteam [06:09] *** Atluxity has joined #archiveteam [06:18] *** Command-S has joined #archiveteam [06:18] *** Control-S has quit IRC (Read error: Connection reset by peer) [06:19] zinelibrary.info project status? [06:19] *** RichardG has joined #archiveteam [06:21] *** bsmith096 has joined #archiveteam [06:23] *** mistym_ has quit IRC (Remote host closed the connection) [06:24] *** bsmith096 has quit IRC (Read error: Connection reset by peer) [06:25] *** RichardG has quit IRC (Ping timeout: 255 seconds) [06:26] *** signius has quit IRC (Ping timeout: 512 seconds) [06:27] Anyone have ideas where I might find pre-1993 source code for sudo? I'm (out of idle curiosity) trying to research the "sudo lecture", and the current repo only goes back to 1993... [06:28] *** bsmith096 has joined #archiveteam [06:36] *** signius has joined #archiveteam [06:37] *** RichardG has joined #archiveteam [06:37] ive been lurking for a while. sup? [06:38] im the loon grabbing ffnet and ao3 slowly [06:39] 200KBps slow, damn throttling servers [06:45] bsmith096: thanks for grabbing ao3 -- it rather worries me about that not being LOCKSS, although I can understand the concerns. [06:47] *** simpleirc has joined #archiveteam [06:47] *** bsmith096 has quit IRC (Read error: Connection reset by peer) [06:48] jesseW: LOCKSS? [06:50] simpleirc: Lots Of Copies Keep Stuff Save [06:50] *** simpleirc has quit IRC (Read error: Connection reset by peer) [06:50] er, Safe [06:50] not Save [06:52] *** bzc6p has joined #archiveteam [06:52] *** swebb sets mode: +o bzc6p [06:59] *** signius has quit IRC (Ping timeout: 512 seconds) [07:08] *** signius has joined #archiveteam [07:15] *** primus104 has joined #archiveteam [07:17] *** jmc has quit IRC (Read error: Connection reset by peer) [07:17] *** jmc has joined #archiveteam [07:24] *** mistym has joined #archiveteam [07:41] *** mistym has quit IRC (Ping timeout: 483 seconds) [07:48] chfoo yipdw: the warrior default should be changed back to Halo. [07:50] well, I found (at archive.org) the 1985 sudo code: https://archive.org/download/usenet-net/net.sources.mbox.zip -- it doesn't mention the "sudo lecture", though. Must have been added sometime between '85 and '93. [07:53] *** primus104 has quit IRC (Leaving.) [07:54] *** JesseW has quit IRC (Ping timeout: 252 seconds) [08:18] *** RichardG has quit IRC (Ping timeout: 240 seconds) [08:33] *** mistym has joined #archiveteam [08:42] *** mistym has quit IRC (Ping timeout: 512 seconds) [09:16] *** bzc6p has quit IRC (Ping timeout: 600 seconds) [09:17] *** bzc6p has joined #archiveteam [09:17] *** swebb sets mode: +o bzc6p [09:17] *** bsmith096 has joined #archiveteam [09:30] *** bsmith096 has quit IRC (Ping timeout: 370 seconds) [09:47] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [09:48] *** Aranje has quit IRC (Ping timeout: 240 seconds) [09:49] *** Froggypwn has joined #archiveteam [09:56] *** sirdancea has joined #archiveteam [10:01] *** MMovie has quit IRC (Quit: Leaving.) [10:02] *** MMovie has joined #archiveteam [10:19] *** lexicon has quit IRC (Read error: Operation timed out) [10:22] *** SadDM_ has quit IRC (Remote host closed the connection) [10:22] *** sirdancea has quit IRC (Read error: Operation timed out) [10:22] *** mistym has joined #archiveteam [10:29] *** mistym has quit IRC (Read error: Operation timed out) [10:38] *** lexicon has joined #archiveteam [10:39] *** SadDM has joined #archiveteam [10:39] *** swebb sets mode: +o SadDM [11:15] *** lexicon has quit IRC (Read error: Operation timed out) [11:19] *** SadDM has quit IRC (Remote host closed the connection) [11:34] *** lexicon has joined #archiveteam [11:35] *** SadDM has joined #archiveteam [11:35] *** swebb sets mode: +o SadDM [11:40] *** lexicon has quit IRC (Read error: Operation timed out) [11:45] *** SadDM has quit IRC (Remote host closed the connection) [11:57] *** lexicon has joined #archiveteam [12:02] *** SadDM has joined #archiveteam [12:02] *** swebb sets mode: +o SadDM [12:12] *** Froggypwn has quit IRC (Ping timeout: 240 seconds) [12:13] *** Froggypwn has joined #archiveteam [12:15] *** RichardG has joined #archiveteam [12:32] *** sankin has joined #archiveteam [13:13] *** zhongfu has quit IRC (Quit: Goodbye.) [13:15] *** zhongfu has joined #archiveteam [13:28] *** primus104 has joined #archiveteam [13:45] *** Froggypwn has quit IRC (Ping timeout: 492 seconds) [13:46] *** Froggypwn has joined #archiveteam [14:24] *** vOYtEC has quit IRC (Ping timeout: 512 seconds) [14:27] *** JesseW has joined #archiveteam [14:35] *** vOYtEC has joined #archiveteam [14:37] *** primus104 has quit IRC (Leaving.) [14:38] *** mistym has joined #archiveteam [14:41] JesseW: might be something in here? https://archive.org/details/CDROM_March92 [14:42] *** Aranje has joined #archiveteam [14:45] *** mistym has quit IRC (Remote host closed the connection) [15:01] *** JesseW has quit IRC (Quit: Leaving.) [15:02] *** mistym has joined #archiveteam [15:03] *** Emcy has joined #archiveteam [15:52] *** mistym has quit IRC (Remote host closed the connection) [15:54] *** signius has quit IRC (Remote host closed the connection) [15:56] *** signius has joined #archiveteam [15:59] *** signius has quit IRC (Client Quit) [16:01] *** bzc6p has quit IRC (Read error: Operation timed out) [16:03] *** signius has joined #archiveteam [16:05] *** signius has quit IRC (Remote host closed the connection) [16:09] *** mistym has joined #archiveteam [16:18] *** sirdancea has joined #archiveteam [16:29] *** laxity has joined #archiveteam [16:32] *** Ravenloft has quit IRC (Remote host closed the connection) [16:37] *** signius has joined #archiveteam [16:40] *** signius has quit IRC (Remote host closed the connection) [16:42] *** signius has joined #archiveteam [16:49] *** anon_ has joined #archiveteam [16:52] *** philpem has joined #archiveteam [16:53] *** anon_ has quit IRC (Client Quit) [17:01] *** nertzy has quit IRC (Leaving) [17:02] *** signius has quit IRC (Quit: Leaving) [17:03] *** aaaaaaaaa has joined #archiveteam [17:04] *** swebb sets mode: +o aaaaaaaaa [17:05] *** signius has joined #archiveteam [17:07] *** McGEE has joined #archiveteam [17:12] *** bzc6p has joined #archiveteam [17:12] *** swebb sets mode: +o bzc6p [17:14] *** nertzy has joined #archiveteam [17:23] I am assuming POMF is done [17:27] SketchCow: It's, as far as I know, isn't. [17:27] There are a few "problematic" items and we're waiting for the administrator to give access. [17:28] This is the situation, except if something happened during the few hours while I was cut off (probably not). [17:28] Let's wait for arkiver 's announcment. [17:31] *** nox has quit IRC () [17:33] So Instacast's parent company Vemedio has run out of money [17:33] They immediately deleted their website and Twitter account [17:33] *** mistym has quit IRC (Remote host closed the connection) [17:34] instacastcloud.com is still up, but who knows for how long [17:34] gitorious is on track to finish transferring on 21-jun [17:34] SketchCow: pomf is probably done, but I'd like to confirm that with the owner of pomf first [17:35] (if anyone cares anymore :P ) [17:36] *** sirdancea has quit IRC (Read error: Operation timed out) [17:37] instacast items are follow the format of https://instacastcloud.com/shared/episode/ID, with ID being any set of 1-4 digit alphanumeric characters (they might go higher, not too sure yet) [17:37] https://instacastcloud.com/b/ID also returns the same content [17:38] mp3s are hosted externally, but probably should be downloaded as well [17:38] all are valid or is the numbering space sparse? [17:38] looks like all are valid [17:40] i'd suggest #instacrap as the irc channel [17:49] how about #latercast [17:49] instacrap sounds too dismissive to me [17:51] xmc: hopefully it goes better towards the end this time. [17:51] latercast will work [17:51] aaaaaaaaa: gitorious? yes, yes it will. i'm dd:ing the disk image over, instead of touching the filesystem [17:54] ah, good thinking. [17:55] i had to restart it as the ssh tunnel died a few hundred GB in [17:55] dd:ing sounds like an awesome game title [17:56] but i used the same offset on both sides (skip/seek) so it's fine [17:56] it's uh this delightful command [17:56] ssh -C duncan@ratt.gitorious.c.bitbit.net dd if=/dev/mapper/steelheart.shortcut-gitorious bs=16M skip=79100 | pv -L 5m -s 3892117920k | dd bs=16M seek=79100 of=gitorious.ext4 [18:05] *** mistym has joined #archiveteam [18:08] *** nox has joined #archiveteam [18:09] xmc: I care [18:24] *** primus104 has joined #archiveteam [18:33] *** primus104 has quit IRC (Leaving.) [18:34] *** ripvanwin has quit IRC (Leaving) [18:34] *** ripvanwin has joined #archiveteam [18:56] *** kyan has joined #archiveteam [19:07] *** McGEE has quit IRC (Quit: Connection closed for inactivity) [19:12] *** mistym has quit IRC (Remote host closed the connection) [19:26] *** mistym has joined #archiveteam [19:32] *** primus104 has joined #archiveteam [19:38] SketchCow: can we start a project for instacast on FOS in a bit? [19:39] *** habi has joined #archiveteam [19:39] Instacast has links to audio files on every page, however those audio files are not hosted on instacast but on an external website [19:39] Is it ok if we also save the external mp3's that are not hosted on instacast? (they're big sometimes) [19:40] for example [19:40] https://instacastcloud.com/shared/episode/2ix9 [19:40] has an audio file here: [19:40] http://www.vidadefudido.com.br/podpress_trac/feed/451/0/PODCAST_VDF_13_ETIQUETA.mp3 [19:40] *** habi has left [19:40] Shall we also save the audio files or only everything that is on instacast? [19:41] (no external files, so no audio files that means, only pages on instacastcloud.com) [19:41] #latercast [19:45] *** zenguy_pc has quit IRC (Ping timeout: 306 seconds) [19:49] *** zenguy_pc has joined #archiveteam [19:58] *** kyan has quit IRC (Quit: This computer has gone to sleep) [19:58] *** kyan has joined #archiveteam [20:05] *** habi1 has joined #archiveteam [20:06] *** mistym has quit IRC (Remote host closed the connection) [20:09] *** kyan has quit IRC (Quit: Leaving) [20:19] *** habi1 has quit IRC (Ping timeout: 362 seconds) [20:22] *** mistym has joined #archiveteam [20:27] *** useretail has joined #archiveteam [20:41] *** username1 has joined #archiveteam [20:44] *** schbirid has quit IRC (Read error: Operation timed out) [20:44] *** schbirid has joined #archiveteam [20:47] *** schbirid2 has quit IRC (Read error: Operation timed out) [20:54] *** fx_ has quit IRC (Read error: Operation timed out) [20:54] *** fx_ has joined #archiveteam [20:58] *** sankin has quit IRC (Leaving.) [21:00] *** ete_ has joined #archiveteam [21:07] What IS instacast? [21:07] Like, what IS it [21:09] I've just done some reading. [21:09] My question is if instacast hosted podcast files. [21:11] If this is just a nice wrapper to a pile of external podcasts, while I like the idea of saving podcasts and maybe using instacast to do so, I am not convinced this is a value add or that the podcasts are at risk. [21:13] *** sirdancea has joined #archiveteam [21:24] *** username1 has quit IRC (Quit: Leaving) [21:35] *** Rotab has quit IRC (Ping timeout: 198 seconds) [21:39] *** Rotab has joined #archiveteam [21:41] *** BlueMaxim has quit IRC (Read error: Operation timed out) [21:41] *** BlueMaxim has joined #archiveteam [21:52] *** zenguy_pc has quit IRC (Ping timeout: 306 seconds) [21:54] *** schbirid has quit IRC (Remote host closed the connection) [21:59] *** zenguy_pc has joined #archiveteam [22:00] *** Stiletto has quit IRC () [22:10] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [22:11] *** zenguy_pc has joined #archiveteam [22:26] SketchCow: as far as I have seen instacast doesn't host podcast files [22:27] So it's at best a nice directory. [22:27] As far as I know the podcast are not at risk. Downloading all of them would be a few TB, so it might be better to not grab the podcasts then. [22:28] yes [22:30] Hmmm. [22:30] Well, if it was truly something like 5tb, it might be worth grabbing mp3 files too [22:30] Just to be sure [22:31] I worry about it being much more. [22:32] *** tephra_ has quit IRC (Read error: Operation timed out) [22:33] It might be more then 5 TB, we can do a partial grab to make a good estimate [22:36] I'll try to make a good estimate [22:44] SketchCow: currently running a scan to get the total size of all podcasts [22:46] *** ete_ has quit IRC (Ping timeout: 492 seconds) [22:47] *** tephra has joined #archiveteam [22:56] *** Start-mob has joined #archiveteam [22:57] *** tephra has quit IRC (Read error: Operation timed out) [23:00] *** Start-mob has quit IRC (Client Quit) [23:00] *** Start-mob has joined #archiveteam [23:02] I definitely believe saving podcasts is a good idea, I just question if this is a haphazard way [23:02] We have a couple "bulkgrabs" of podcasts going back. At some point, it's obvious archive.org will want to make a directory of everything. [23:02] I have a pile of CD-ROMs I'm adding this year of 2004-era podcasts. [23:03] Oh, my french uploader has given me 25 more CD-ROMs of goodies [23:09] If we want to archive podcasts on a large scale, doing it through this site isn't the best way to do it [23:09] We should make coordinated projects if we want to save all podcasts [23:10] *** tephra has joined #archiveteam [23:10] Not by a website that has a bit of everything [23:10] Also, the podcasts that are going to be grabbed with this website will probably be grabbed again if we start to grab individual podcast hosting websites [23:11] But podcasts might linked to from this website might be gone once we start such projects ^ [23:12] So if we don't have plans to do large archiving projects of podcast websites this year, it would be good to save the podcasts linked to from instacast [23:14] SketchCow: the scan of the total size is still running, but how it looks now total size might be around 12 TB [23:14] Might also go down to 8 or go up to 15 TB [23:39] *** ete_ has joined #archiveteam [23:46] *** primus104 has quit IRC (Leaving.)