[00:05] *** dtm has quit IRC (Read error: Operation timed out) [00:11] *** ete_ has quit IRC (Read error: Connection reset by peer) [00:11] *** cechk01 has quit IRC (Read error: Connection reset by peer) [00:12] *** dtm has joined #archiveteam [00:59] *** sep332 has quit IRC (Read error: Operation timed out) [01:00] *** sep332 has joined #archiveteam [01:02] *** bwn has quit IRC (Read error: Connection reset by peer) [01:03] *** bwn has joined #archiveteam [01:05] tree3: Any luck with downloading the videos by chance? [01:10] phuzion: For what it's worth, I've gotten most of them [01:10] kyan: How many did you grab? [01:11] not sure yet [01:11] Thing is, some of them did'nt download fully [01:11] (left .part files in the directory) [01:11] What are you using to download? youtube-dl? [01:11] yup [01:11] Getting around the throttle somehow? I'm getting 500KB/s max [01:11] and some of them weren't available due to takedown requests and stuff [01:12] Yeah, youtube-dl does a rate-bypass thng [01:12] I got between 5 and 35 mbps [01:12] You can see what I got at https://archive.org/search.php?query=subject%3A%22WARCdealer%20pack%22%20AND%20subject%3A%22rochu%22 [01:13] not all of tem are uploaded yet though [01:13] gotcha [01:27] *** ParkerR has quit IRC (Remote host closed the connection) [01:42] *** philpem has quit IRC (Ping timeout: 252 seconds) [01:43] *** bwn_ has joined #archiveteam [01:46] *** xXx_ndidd has joined #archiveteam [01:47] *** K4k_ has quit IRC (Read error: Operation timed out) [01:47] *** Ghost_of_ has quit IRC (Remote host closed the connection) [01:47] *** K4k_ has joined #archiveteam [01:49] *** bwn has quit IRC (Read error: Operation timed out) [01:53] *** ndiddy has quit IRC (Read error: Operation timed out) [01:58] phuzion: You can see the ones that I haven't gotten yet at http://paste.ubuntu.com/13913492/ [01:58] search for ".part" [01:59] *** JesseW has joined #archiveteam [02:26] *** Start has joined #archiveteam [02:42] *** ParkerR has joined #archiveteam [03:10] *** cechk01 has joined #archiveteam [03:17] *** RichardG has quit IRC (Ping timeout: 252 seconds) [03:25] *** Ungstein1 has quit IRC (Quit: Leaving.) [03:38] *** Start has quit IRC (Quit: Disconnected.) [03:40] *** bwn_ has quit IRC (Read error: Operation timed out) [03:40] *** Start has joined #archiveteam [03:48] *** WinterFox has joined #archiveteam [03:52] *** balrog has quit IRC (Bye) [03:54] *** balrog has joined #archiveteam [03:54] *** swebb sets mode: +o balrog [03:54] *** JetBalsa has joined #archiveteam [03:55] I have a interesting question, I want to run a warrior but not in a VM but on shell on a existing system [03:55] whats the current codebase at, I found seesaw kit and warrior and I'm confused on a current warrior in use. [03:59] running the warrior outside of a VM is sort of discouraged, because the consistency provided by the image is no longer there- usually you would just run the script for whatever project you are interested in [04:03] each project has general instructions for running without a warrior, as well as distribution specific instructions [04:04] Yeah, it's totally doable. Lots of us do it. It's just something that requires a little bit more knowledge. [04:05] Getting people to run the warrior is easy because it's "install virtualbox, download this file, and do file > import > click every next button you see, right click the VM and start it" [04:08] I kinda like the set and forget aspect of the auto side of things :3 [04:08] Which is what the warrior vm is great for. [04:09] Ya, [04:09] I wonder if I can cross load into qemu, trying now :3 [04:09] We should either continue this conversation in #archiveteam-bs or #warrior [04:10] ill move warrior [04:37] https://archive.org/details/macaddict&tab=collection [05:04] *** JetBalsa has quit IRC (Quit: Page closed) [05:05] SketchCow: cool [05:06] i'm grabbing issue one of macaddict right now [05:06] is wired magazine going to be put up too? [05:07] *** aaaaaaaaa has quit IRC (Leaving) [05:11] *** indrora has joined #archiveteam [05:12] Maybe. [05:13] *** indrora has left [05:14] *** indrora has joined #archiveteam [05:15] So, a site that I vaguely believe should be archived is ska-dead due to problems. Was wondering if there's anything I can do to make archive team and FurNation (think pre-geocities for furries, started ~1996) get together? [05:31] indrora: What's the URL? About how many pages is it? [05:33] JesseW, http://furnation.com/ and probably somewhere in the order of 10ish TB -- 20 years of furries. [05:33] What I know from the twitter is that their primary servers had 128GB of RAM and several 2TB disks [05:36] *** voltagex has quit IRC (Quit: WeeChat 1.3) [05:37] *** JetBalsa has joined #archiveteam [05:40] I have no idea the actual /scale/. I know that if the maintainer can be contacted, it'd be relatively simple to do what was done with pomf.se [05:40] What did you mean by "ska-dead"? (not a term I've heard) [05:43] *** nertzy has joined #archiveteam [05:45] It's dead. So dead if it were any deadder it'd be pushing up daisies. [05:45] There's no read-only version. There's no access other than broken archive.org content. [05:48] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [05:49] ah. [05:51] *** xXx_ndidd has quit IRC (Read error: Connection reset by peer) [05:54] Much of the site was powered by a lot of custom PHP. [05:56] Do you have any means of contacting the admin? [05:58] *** Sk1d has joined #archiveteam [05:58] The most I know is via Twitter ( @furnation ) -- I've idly mentioned textfiles and Archive Team, but I'm personally not aware of any direct way to contact them. [06:06] I'll see what I can do to get in contact with them. [06:17] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [06:19] Yes, point them at textfiles/SketchCow/Jason Scott. [06:21] Will do. [06:27] *** VonGuard has quit IRC (Read error: Connection reset by peer) [06:27] *** VonGuard has joined #archiveteam [07:13] Are there any plans to archive old reddit posts? [07:14] JetBalsa: https://archive.org/details/2015_reddit_comments_corpus [07:15] Approximately 350,000 comments out of ~1.65 billion were unavailable [07:16] Also, I was thinking of entire threads in context [07:18] *** remsen2 has joined #archiveteam [07:18] JetBalsa: the dataset there doesn't have a 'parent' for the comments? [07:18] looks like thats a smaller dataset [07:18] others exists: https://www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/ [07:18] *** R5M has joined #archiveteam [07:19] * ivan` sees a 'parent' on "Example JSON Block" [07:19] its there, I see that now [07:19] the page I linked links to that :/ [07:19] GG [07:19] *** remsen has quit IRC (Read error: Operation timed out) [07:20] I read the 300k out of sentence backwards, sorrya about that [07:20] *** MMovie1 has quit IRC (Read error: Connection reset by peer) [07:22] *** MMovie has joined #archiveteam [07:22] you could theoretically bruteforce the entire reddit object ID space [07:23] Given that basically everything in reddit is a K:V [07:23] *** remsen2 has quit IRC (Read error: Operation timed out) [07:24] *** R5M has quit IRC (Leaving) [07:28] SketchCow: i'm looking at the mac addict magazines [07:28] and i think most of covers have to be rescan [07:28] other then that there very good [07:29] i only say that cause they looked a bit cut off on the left with alot of the covers [07:51] *** bwn_ has joined #archiveteam [07:57] *** tree3 has quit IRC (Read error: Operation timed out) [07:57] *** JesseW has quit IRC (Leaving.) [08:07] *** remsen has joined #archiveteam [08:13] *** WinterFox has quit IRC (Read error: Operation timed out) [08:16] *** WinterFox has joined #archiveteam [08:45] *** midas1 is now known as midas [08:45] yes! [08:59] *** JetBalsa has quit IRC (Read error: Connection reset by peer) [09:29] *** cadbury has quit IRC (Read error: Operation timed out) [09:36] *** schbirid has joined #archiveteam [09:46] *** BlueMaxim has quit IRC (Quit: Leaving) [10:22] *** wutno has joined #archiveteam [10:24] *** WapCapLet has quit IRC (Read error: Operation timed out) [10:36] *** blergh- has quit IRC (Remote host closed the connection) [11:16] *** vitzli has joined #archiveteam [12:07] xmc: Did the regex for gitorious work? [12:58] *** dashcloud has quit IRC (Read error: Operation timed out) [13:01] *** dashcloud has joined #archiveteam [13:14] *** Billy_ has joined #archiveteam [13:14] *** Billy__ has joined #archiveteam [13:15] * Billy__ slaps dashcloud around a bit with a large fishbot [13:18] *** Billy_ has quit IRC (Ping timeout: 240 seconds) [13:19] *** Billy__ has quit IRC (Ping timeout: 240 seconds) [13:26] *** RichardG has joined #archiveteam [13:28] *** REiN^ has joined #archiveteam [13:36] *** melody has joined #archiveteam [13:45] *** philpem has joined #archiveteam [14:18] *** WinterFox has quit IRC (Read error: Operation timed out) [14:20] *** WinterFox has joined #archiveteam [14:36] *** Rickster has joined #archiveteam [14:40] *** Stiletto has quit IRC () [14:42] *** nertzy has joined #archiveteam [14:59] *** Stiletto has joined #archiveteam [15:02] Anyone know if the BYTE magazine archive on archive.org is available in a zip or tar format somewhere? I'd like the whole collection but I don't want to have to get every issue seperately. [15:05] K4k_: I don't know if there's a separate item for the whole thing, but you can use their API to get a whole collection: https://emerging.commons.gc.cuny.edu/2014/03/downloading-items-internet-archive-collection-using-python/ [15:05] *** Vito`__ is now known as Vito` [15:26] *** bauruine has quit IRC (Ping timeout: 252 seconds) [15:27] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [15:56] *** andrewf has joined #archiveteam [15:56] *** tree3 has joined #archiveteam [16:03] *** andrewf has quit IRC (Quit: Page closed) [16:33] *** bauruine has joined #archiveteam [16:44] *** remsen2 has joined #archiveteam [16:44] *** remsen2 has quit IRC (Remote host closed the connection) [16:49] *** remsen has quit IRC (Read error: Operation timed out) [16:52] *** Woflie has joined #archiveteam [16:53] o7 http://furnation.com/ o7 *Bugles out some taps, disappears* [16:53] *** Woflie has quit IRC (Client Quit) [16:54] *** remsen has joined #archiveteam [17:17] would anyone care to help me find (if possible) the files for https://web.archive.org/web/20110302231052/http://wano.blip.tv/posts?view=archive&nsfw=dc [17:18] *** VADemon has joined #archiveteam [17:18] I read somewhere that all of blip.tv was archived somehow [17:18] but I can't seem to find them [17:18] I'll have a look at it [17:19] thanks [17:19] as far as I remember blip.tv also has an option for "upload this file to archive.org as well?" and I think that option was used [17:22] so it looks like the IDs of the videos of https://web.archive.org/web/20110302231052/http://wano.blip.tv/posts?view=archive&nsfw=dc are not the same as on the normal blip.tv site [17:22] We did not archive wano.blip.tv, we only archived blip.tv [17:22] But if the videos from wano are also on blip.tv, they should be saved [17:23] I think they used to be blip.tv/wano [17:23] or simular [17:26] I can't find the videos. The IDs are not the same as on blip.tv. [17:26] It's from 2011 though [17:26] It's very possible that these were already gone when we started archiving [17:29] right [17:29] ok, thanks for trying [17:42] *** vitzli has quit IRC (Leaving) [17:51] *** remsen2 has joined #archiveteam [17:52] *** remsen2 has quit IRC (Remote host closed the connection) [17:57] *** remsen has quit IRC (Read error: Operation timed out) [18:07] *** remsen has joined #archiveteam [18:09] *** remsen has quit IRC (Client Quit) [18:10] *** remsen has joined #archiveteam [18:12] Vito`: Thanks, I didn't know they had an API for that kind of operation. I will look in to it! [18:14] PurpleSym: gitorious is ready but i have to do some sysadmining on the backend still [18:19] *** nertzy has joined #archiveteam [18:19] Alright. [18:47] SketchCow: I'm not sure how much free space FOS has right now, but a lot of new FTP data is currently coming in [18:54] I am aware. We're holding out (under 50% use) but that 50% use is one day's downloads, so that's pretty involved. [19:03] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [19:11] *** bwn_ has quit IRC (Read error: Operation timed out) [19:25] *** arkiver2 has joined #archiveteam [19:25] *** aaaaaaaaa has joined #archiveteam [19:25] *** swebb sets mode: +o aaaaaaaaa [19:28] *** xmc has quit IRC (Quit: brb rebooting) [19:29] *** cadbury has joined #archiveteam [19:32] *** WHO HERE IS DKL3 ON ARCHVE [19:35] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [19:36] I see DKL3 uploaded a lot of WARCs [19:36] what's the problem with them? [19:38] https://trello.com/dkl31 from Bibliotheca Anonoma [19:40] *** bwn has joined #archiveteam [19:42] *** xmc has joined #archiveteam [19:42] *** swebb sets mode: +o xmc [19:44] antonizoo might know more about who dkl3 is [19:49] No, no. [19:49] The upshot is they can upload WARCs but they're not going into Wayback. [19:53] *** arkiver2 has joined #archiveteam [20:00] *** BlueMaxim has joined #archiveteam [20:00] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [20:04] *** arkiver2 has joined #archiveteam [20:24] *** ndiddy has joined #archiveteam [20:25] *** K4k_ has quit IRC (Quit: WeeChat 1.3) [20:33] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [21:01] *** JetBalsa has joined #archiveteam [21:01] Whats the current status of Yuko project, I have not gotten any new items in 24hr [21:07] Yuku* [21:08] JetBalsa: Doesn't appear to be giving out new items at this time for one reason or another. [21:24] Term1T3rm1n@l1! [21:24] *** RichardG has quit IRC (Read error: Connection reset by peer) [21:27] *** RichardG has joined #archiveteam [21:30] *** arkiver2 has joined #archiveteam [21:57] *** WinterFox has quit IRC (Read error: Operation timed out) [22:00] *** WinterFox has joined #archiveteam [22:06] *** Ghost_of_ has joined #archiveteam [22:19] *** arkiver2 has quit IRC (Quit: Nettalk6 - www.ntalk.de) [22:26] JetBalsa: you may want to change that password now that it's out in public [22:27] Its a temp that I give out to nerds, Its ment to be changed, but good thing its not on any public systems [22:27] GG, Damn you keypass [22:27] just wanted to make sure you knew that it was pasted here [22:27] Ya [22:28] *** melody has quit IRC (Read error: Operation timed out) [22:28] Hello. I noticed that I was mentioned here. [22:29] Regarding, specifically, DKL3's WARCs. [22:30] *** melody has joined #archiveteam [22:31] I can answer any questions about it, because we have wanted to contact the Internet Archive directly regarding the upload of these WARCs for a while [22:33] Wherever you'd like to ask please ping me [22:33] Or pm [22:50] ------------------------------------------ [22:50] The Google Code project has started! [22:50] Join #googlecodeblue [22:50] ------------------------------------------ [22:50] oh yeah [23:01] *** Ghost_of_ has quit IRC (Quit: Leaving) [23:01] *** Ghost_of_ has joined #archiveteam [23:49] *** dashcloud has quit IRC (Read error: Connection reset by peer) [23:49] *** dashcloud has joined #archiveteam