#archiveteam 2015-12-11,Fri

↑back Search

Time Nickname Message
00:05 🔗 dtm has quit IRC (Read error: Operation timed out)
00:11 🔗 ete_ has quit IRC (Read error: Connection reset by peer)
00:11 🔗 cechk01 has quit IRC (Read error: Connection reset by peer)
00:12 🔗 dtm has joined #archiveteam
00:59 🔗 sep332 has quit IRC (Read error: Operation timed out)
01:00 🔗 sep332 has joined #archiveteam
01:02 🔗 bwn has quit IRC (Read error: Connection reset by peer)
01:03 🔗 bwn has joined #archiveteam
01:05 🔗 phuzion tree3: Any luck with downloading the videos by chance?
01:10 🔗 kyan phuzion: For what it's worth, I've gotten most of them
01:10 🔗 phuzion kyan: How many did you grab?
01:11 🔗 kyan not sure yet
01:11 🔗 kyan Thing is, some of them did'nt download fully
01:11 🔗 kyan (left .part files in the directory)
01:11 🔗 phuzion What are you using to download? youtube-dl?
01:11 🔗 kyan yup
01:11 🔗 phuzion Getting around the throttle somehow? I'm getting 500KB/s max
01:11 🔗 kyan and some of them weren't available due to takedown requests and stuff
01:12 🔗 kyan Yeah, youtube-dl does a rate-bypass thng
01:12 🔗 kyan I got between 5 and 35 mbps
01:12 🔗 kyan You can see what I got at https://archive.org/search.php?query=subject%3A%22WARCdealer%20pack%22%20AND%20subject%3A%22rochu%22
01:13 🔗 kyan not all of tem are uploaded yet though
01:13 🔗 phuzion gotcha
01:27 🔗 ParkerR has quit IRC (Remote host closed the connection)
01:42 🔗 philpem has quit IRC (Ping timeout: 252 seconds)
01:43 🔗 bwn_ has joined #archiveteam
01:46 🔗 xXx_ndidd has joined #archiveteam
01:47 🔗 K4k_ has quit IRC (Read error: Operation timed out)
01:47 🔗 Ghost_of_ has quit IRC (Remote host closed the connection)
01:47 🔗 K4k_ has joined #archiveteam
01:49 🔗 bwn has quit IRC (Read error: Operation timed out)
01:53 🔗 ndiddy has quit IRC (Read error: Operation timed out)
01:58 🔗 kyan phuzion: You can see the ones that I haven't gotten yet at http://paste.ubuntu.com/13913492/
01:58 🔗 kyan search for ".part"
01:59 🔗 JesseW has joined #archiveteam
02:26 🔗 Start has joined #archiveteam
02:42 🔗 ParkerR has joined #archiveteam
03:10 🔗 cechk01 has joined #archiveteam
03:17 🔗 RichardG has quit IRC (Ping timeout: 252 seconds)
03:25 🔗 Ungstein1 has quit IRC (Quit: Leaving.)
03:38 🔗 Start has quit IRC (Quit: Disconnected.)
03:40 🔗 bwn_ has quit IRC (Read error: Operation timed out)
03:40 🔗 Start has joined #archiveteam
03:48 🔗 WinterFox has joined #archiveteam
03:52 🔗 balrog has quit IRC (Bye)
03:54 🔗 balrog has joined #archiveteam
03:54 🔗 swebb sets mode: +o balrog
03:54 🔗 JetBalsa has joined #archiveteam
03:55 🔗 JetBalsa I have a interesting question, I want to run a warrior but not in a VM but on shell on a existing system
03:55 🔗 JetBalsa whats the current codebase at, I found seesaw kit and warrior and I'm confused on a current warrior in use.
03:59 🔗 dashcloud running the warrior outside of a VM is sort of discouraged, because the consistency provided by the image is no longer there- usually you would just run the script for whatever project you are interested in
04:03 🔗 aaaaaaaaa each project has general instructions for running without a warrior, as well as distribution specific instructions
04:04 🔗 phuzion Yeah, it's totally doable. Lots of us do it. It's just something that requires a little bit more knowledge.
04:05 🔗 phuzion Getting people to run the warrior is easy because it's "install virtualbox, download this file, and do file > import > click every next button you see, right click the VM and start it"
04:08 🔗 JetBalsa I kinda like the set and forget aspect of the auto side of things :3
04:08 🔗 phuzion Which is what the warrior vm is great for.
04:09 🔗 JetBalsa Ya,
04:09 🔗 JetBalsa I wonder if I can cross load into qemu, trying now :3
04:09 🔗 phuzion We should either continue this conversation in #archiveteam-bs or #warrior
04:10 🔗 JetBalsa ill move warrior
04:37 🔗 SketchCow https://archive.org/details/macaddict&tab=collection
05:04 🔗 JetBalsa has quit IRC (Quit: Page closed)
05:05 🔗 godane SketchCow: cool
05:06 🔗 godane i'm grabbing issue one of macaddict right now
05:06 🔗 godane is wired magazine going to be put up too?
05:07 🔗 aaaaaaaaa has quit IRC (Leaving)
05:11 🔗 indrora has joined #archiveteam
05:12 🔗 SketchCow Maybe.
05:13 🔗 indrora has left
05:14 🔗 indrora has joined #archiveteam
05:15 🔗 indrora So, a site that I vaguely believe should be archived is ska-dead due to problems. Was wondering if there's anything I can do to make archive team and FurNation (think pre-geocities for furries, started ~1996) get together?
05:31 🔗 JesseW indrora: What's the URL? About how many pages is it?
05:33 🔗 indrora JesseW, http://furnation.com/ and probably somewhere in the order of 10ish TB -- 20 years of furries.
05:33 🔗 indrora What I know from the twitter is that their primary servers had 128GB of RAM and several 2TB disks
05:36 🔗 voltagex has quit IRC (Quit: WeeChat 1.3)
05:37 🔗 JetBalsa has joined #archiveteam
05:40 🔗 indrora I have no idea the actual /scale/. I know that if the maintainer can be contacted, it'd be relatively simple to do what was done with pomf.se
05:40 🔗 JesseW What did you mean by "ska-dead"? (not a term I've heard)
05:43 🔗 nertzy has joined #archiveteam
05:45 🔗 indrora It's dead. So dead if it were any deadder it'd be pushing up daisies.
05:45 🔗 indrora There's no read-only version. There's no access other than broken archive.org content.
05:48 🔗 Sk1d has quit IRC (Ping timeout: 250 seconds)
05:49 🔗 JesseW ah.
05:51 🔗 xXx_ndidd has quit IRC (Read error: Connection reset by peer)
05:54 🔗 indrora Much of the site was powered by a lot of custom PHP.
05:56 🔗 JesseW Do you have any means of contacting the admin?
05:58 🔗 Sk1d has joined #archiveteam
05:58 🔗 indrora The most I know is via Twitter ( @furnation ) -- I've idly mentioned textfiles and Archive Team, but I'm personally not aware of any direct way to contact them.
06:06 🔗 indrora I'll see what I can do to get in contact with them.
06:17 🔗 nertzy has quit IRC (Quit: This computer has gone to sleep)
06:19 🔗 JesseW Yes, point them at textfiles/SketchCow/Jason Scott.
06:21 🔗 indrora Will do.
06:27 🔗 VonGuard has quit IRC (Read error: Connection reset by peer)
06:27 🔗 VonGuard has joined #archiveteam
07:13 🔗 JetBalsa Are there any plans to archive old reddit posts?
07:14 🔗 ivan` JetBalsa: https://archive.org/details/2015_reddit_comments_corpus
07:15 🔗 JetBalsa Approximately 350,000 comments out of ~1.65 billion were unavailable
07:16 🔗 JetBalsa Also, I was thinking of entire threads in context
07:18 🔗 remsen2 has joined #archiveteam
07:18 🔗 ivan` JetBalsa: the dataset there doesn't have a 'parent' for the comments?
07:18 🔗 JetBalsa looks like thats a smaller dataset
07:18 🔗 JetBalsa others exists: https://www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/
07:18 🔗 R5M has joined #archiveteam
07:19 🔗 * ivan` sees a 'parent' on "Example JSON Block"
07:19 🔗 JetBalsa its there, I see that now
07:19 🔗 ivan` the page I linked links to that :/
07:19 🔗 JetBalsa GG
07:19 🔗 remsen has quit IRC (Read error: Operation timed out)
07:20 🔗 JetBalsa I read the 300k out of sentence backwards, sorrya about that
07:20 🔗 MMovie1 has quit IRC (Read error: Connection reset by peer)
07:22 🔗 MMovie has joined #archiveteam
07:22 🔗 indrora you could theoretically bruteforce the entire reddit object ID space
07:23 🔗 indrora Given that basically everything in reddit is a K:V
07:23 🔗 remsen2 has quit IRC (Read error: Operation timed out)
07:24 🔗 R5M has quit IRC (Leaving)
07:28 🔗 godane SketchCow: i'm looking at the mac addict magazines
07:28 🔗 godane and i think most of covers have to be rescan
07:28 🔗 godane other then that there very good
07:29 🔗 godane i only say that cause they looked a bit cut off on the left with alot of the covers
07:51 🔗 bwn_ has joined #archiveteam
07:57 🔗 tree3 has quit IRC (Read error: Operation timed out)
07:57 🔗 JesseW has quit IRC (Leaving.)
08:07 🔗 remsen has joined #archiveteam
08:13 🔗 WinterFox has quit IRC (Read error: Operation timed out)
08:16 🔗 WinterFox has joined #archiveteam
08:45 🔗 midas1 is now known as midas
08:45 🔗 midas yes!
08:59 🔗 JetBalsa has quit IRC (Read error: Connection reset by peer)
09:29 🔗 cadbury has quit IRC (Read error: Operation timed out)
09:36 🔗 schbirid has joined #archiveteam
09:46 🔗 BlueMaxim has quit IRC (Quit: Leaving)
10:22 🔗 wutno has joined #archiveteam
10:24 🔗 WapCapLet has quit IRC (Read error: Operation timed out)
10:36 🔗 blergh- has quit IRC (Remote host closed the connection)
11:16 🔗 vitzli has joined #archiveteam
12:07 🔗 PurpleSym xmc: Did the regex for gitorious work?
12:58 🔗 dashcloud has quit IRC (Read error: Operation timed out)
13:01 🔗 dashcloud has joined #archiveteam
13:14 🔗 Billy_ has joined #archiveteam
13:14 🔗 Billy__ has joined #archiveteam
13:15 🔗 * Billy__ slaps dashcloud around a bit with a large fishbot
13:18 🔗 Billy_ has quit IRC (Ping timeout: 240 seconds)
13:19 🔗 Billy__ has quit IRC (Ping timeout: 240 seconds)
13:26 🔗 RichardG has joined #archiveteam
13:28 🔗 REiN^ has joined #archiveteam
13:36 🔗 melody has joined #archiveteam
13:45 🔗 philpem has joined #archiveteam
14:18 🔗 WinterFox has quit IRC (Read error: Operation timed out)
14:20 🔗 WinterFox has joined #archiveteam
14:36 🔗 Rickster has joined #archiveteam
14:40 🔗 Stiletto has quit IRC ()
14:42 🔗 nertzy has joined #archiveteam
14:59 🔗 Stiletto has joined #archiveteam
15:02 🔗 K4k_ Anyone know if the BYTE magazine archive on archive.org is available in a zip or tar format somewhere? I'd like the whole collection but I don't want to have to get every issue seperately.
15:05 🔗 Vito`__ K4k_: I don't know if there's a separate item for the whole thing, but you can use their API to get a whole collection: https://emerging.commons.gc.cuny.edu/2014/03/downloading-items-internet-archive-collection-using-python/
15:05 🔗 Vito`__ is now known as Vito`
15:26 🔗 bauruine has quit IRC (Ping timeout: 252 seconds)
15:27 🔗 nertzy has quit IRC (Quit: This computer has gone to sleep)
15:56 🔗 andrewf has joined #archiveteam
15:56 🔗 tree3 has joined #archiveteam
16:03 🔗 andrewf has quit IRC (Quit: Page closed)
16:33 🔗 bauruine has joined #archiveteam
16:44 🔗 remsen2 has joined #archiveteam
16:44 🔗 remsen2 has quit IRC (Remote host closed the connection)
16:49 🔗 remsen has quit IRC (Read error: Operation timed out)
16:52 🔗 Woflie has joined #archiveteam
16:53 🔗 Woflie o7 http://furnation.com/ o7 *Bugles out some taps, disappears*
16:53 🔗 Woflie has quit IRC (Client Quit)
16:54 🔗 remsen has joined #archiveteam
17:17 🔗 Atluxity would anyone care to help me find (if possible) the files for https://web.archive.org/web/20110302231052/http://wano.blip.tv/posts?view=archive&nsfw=dc
17:18 🔗 VADemon has joined #archiveteam
17:18 🔗 Atluxity I read somewhere that all of blip.tv was archived somehow
17:18 🔗 Atluxity but I can't seem to find them
17:18 🔗 arkiver I'll have a look at it
17:19 🔗 Atluxity thanks
17:19 🔗 Atluxity as far as I remember blip.tv also has an option for "upload this file to archive.org as well?" and I think that option was used
17:22 🔗 arkiver so it looks like the IDs of the videos of https://web.archive.org/web/20110302231052/http://wano.blip.tv/posts?view=archive&nsfw=dc are not the same as on the normal blip.tv site
17:22 🔗 arkiver We did not archive wano.blip.tv, we only archived blip.tv
17:22 🔗 arkiver But if the videos from wano are also on blip.tv, they should be saved
17:23 🔗 Atluxity I think they used to be blip.tv/wano
17:23 🔗 Atluxity or simular
17:26 🔗 arkiver I can't find the videos. The IDs are not the same as on blip.tv.
17:26 🔗 arkiver It's from 2011 though
17:26 🔗 arkiver It's very possible that these were already gone when we started archiving
17:29 🔗 Atluxity right
17:29 🔗 Atluxity ok, thanks for trying
17:42 🔗 vitzli has quit IRC (Leaving)
17:51 🔗 remsen2 has joined #archiveteam
17:52 🔗 remsen2 has quit IRC (Remote host closed the connection)
17:57 🔗 remsen has quit IRC (Read error: Operation timed out)
18:07 🔗 remsen has joined #archiveteam
18:09 🔗 remsen has quit IRC (Client Quit)
18:10 🔗 remsen has joined #archiveteam
18:12 🔗 K4k_ Vito`: Thanks, I didn't know they had an API for that kind of operation. I will look in to it!
18:14 🔗 xmc PurpleSym: gitorious is ready but i have to do some sysadmining on the backend still
18:19 🔗 nertzy has joined #archiveteam
18:19 🔗 PurpleSym Alright.
18:47 🔗 arkiver SketchCow: I'm not sure how much free space FOS has right now, but a lot of new FTP data is currently coming in
18:54 🔗 SketchCow I am aware. We're holding out (under 50% use) but that 50% use is one day's downloads, so that's pretty involved.
19:03 🔗 nertzy has quit IRC (Quit: This computer has gone to sleep)
19:11 🔗 bwn_ has quit IRC (Read error: Operation timed out)
19:25 🔗 arkiver2 has joined #archiveteam
19:25 🔗 aaaaaaaaa has joined #archiveteam
19:25 🔗 swebb sets mode: +o aaaaaaaaa
19:28 🔗 xmc has quit IRC (Quit: brb rebooting)
19:29 🔗 cadbury has joined #archiveteam
19:32 🔗 SketchCow *** WHO HERE IS DKL3 ON ARCHVE
19:35 🔗 arkiver2 has quit IRC (Ping timeout: 252 seconds)
19:36 🔗 arkiver I see DKL3 uploaded a lot of WARCs
19:36 🔗 arkiver what's the problem with them?
19:38 🔗 arkiver https://trello.com/dkl31 from Bibliotheca Anonoma
19:40 🔗 bwn has joined #archiveteam
19:42 🔗 xmc has joined #archiveteam
19:42 🔗 swebb sets mode: +o xmc
19:44 🔗 arkiver antonizoo might know more about who dkl3 is
19:49 🔗 SketchCow No, no.
19:49 🔗 SketchCow The upshot is they can upload WARCs but they're not going into Wayback.
19:53 🔗 arkiver2 has joined #archiveteam
20:00 🔗 BlueMaxim has joined #archiveteam
20:00 🔗 arkiver2 has quit IRC (Ping timeout: 252 seconds)
20:04 🔗 arkiver2 has joined #archiveteam
20:24 🔗 ndiddy has joined #archiveteam
20:25 🔗 K4k_ has quit IRC (Quit: WeeChat 1.3)
20:33 🔗 arkiver2 has quit IRC (Ping timeout: 252 seconds)
21:01 🔗 JetBalsa has joined #archiveteam
21:01 🔗 JetBalsa Whats the current status of Yuko project, I have not gotten any new items in 24hr
21:07 🔗 JetBalsa Yuku*
21:08 🔗 phuzion JetBalsa: Doesn't appear to be giving out new items at this time for one reason or another.
21:24 🔗 JetBalsa Term1T3rm1n@l1!
21:24 🔗 RichardG has quit IRC (Read error: Connection reset by peer)
21:27 🔗 RichardG has joined #archiveteam
21:30 🔗 arkiver2 has joined #archiveteam
21:57 🔗 WinterFox has quit IRC (Read error: Operation timed out)
22:00 🔗 WinterFox has joined #archiveteam
22:06 🔗 Ghost_of_ has joined #archiveteam
22:19 🔗 arkiver2 has quit IRC (Quit: Nettalk6 - www.ntalk.de)
22:26 🔗 dashcloud JetBalsa: you may want to change that password now that it's out in public
22:27 🔗 JetBalsa Its a temp that I give out to nerds, Its ment to be changed, but good thing its not on any public systems
22:27 🔗 JetBalsa GG, Damn you keypass
22:27 🔗 dashcloud just wanted to make sure you knew that it was pasted here
22:27 🔗 JetBalsa Ya
22:28 🔗 melody has quit IRC (Read error: Operation timed out)
22:28 🔗 antonizoo Hello. I noticed that I was mentioned here.
22:29 🔗 antonizoo Regarding, specifically, DKL3's WARCs.
22:30 🔗 melody has joined #archiveteam
22:31 🔗 antonizoo I can answer any questions about it, because we have wanted to contact the Internet Archive directly regarding the upload of these WARCs for a while
22:33 🔗 antonizoo Wherever you'd like to ask please ping me
22:33 🔗 antonizoo Or pm
22:50 🔗 arkiver ------------------------------------------
22:50 🔗 arkiver The Google Code project has started!
22:50 🔗 arkiver Join #googlecodeblue
22:50 🔗 arkiver ------------------------------------------
22:50 🔗 Atluxity oh yeah
23:01 🔗 Ghost_of_ has quit IRC (Quit: Leaving)
23:01 🔗 Ghost_of_ has joined #archiveteam
23:49 🔗 dashcloud has quit IRC (Read error: Connection reset by peer)
23:49 🔗 dashcloud has joined #archiveteam

irclogger-viewer