#archiveteam 2013-03-19,Tue

↑back Search

Time Nickname Message
00:07 πŸ”— grawity What's the user-agent set to Ҁ“ "Jason/Scott/1.0"?
00:16 πŸ”— GLaDOS It's a default Safari UA I believe
01:06 πŸ”— SketchCow http://www.slate.com/articles/technology/map_of_the_week/2013/03/google_reader_joins_graveyard_of_dead_google_products.html
02:00 πŸ”— * BlueMaxim looks at all these dead Google products and starts crying
02:01 πŸ”— gui77 hey guys, i'm back
02:05 πŸ”— gui77 any idea why my warrior for yahoo messages hasn't been working? it either says "Starting reaper", "no tasks available", or "Service blocked us 1 times, backing off for 5 seconds"..
03:08 πŸ”— Samuel_Mi SketchCow: pls do post the url of the HuffPo interview when it's available
04:47 πŸ”— no2pencil any activity for the phoenix?
04:56 πŸ”— DFJustin https://archive.org/details/archiveteam_blog_thephoenix
05:02 πŸ”— S[h]O[r]T SketchCow, id go out and get a hdd..or just donate $$ for you for a hdd for punchfork. i just dont want to be responsible for holding the only single copy on 1 hdd for the rest of enterinity and if something happens to it..well shit id feel really bad and it would suck
05:26 πŸ”— wp494 any news on the formspring project (particularly if anyone has heard back from tim yet)?
05:27 πŸ”— wp494 #firespring is dead as a doorknob
05:43 πŸ”— S[h]O[r]T http://www.techdirt.com/articles/20130317/16534822353/drm-strikes-again-digital-comics-distributor-jmanga-closing-down-deleting-everyones-purchases.shtml
05:43 πŸ”— S[h]O[r]T wp494 i dont think anyone has..i guess we need to look at scraping to at least get started
05:44 πŸ”— hdevalenc what does one yahoo messages "item" correspond to?
05:45 πŸ”— CoJaBo S[h]O[r]T: lol'd
05:51 πŸ”— chronomex hdevalenc: perhaps a topic/thread
05:53 πŸ”— wp494 @Sh0rt: someone should get on it ASAP in the event of worst case scenario where tim can't get a userlist
06:00 πŸ”— omf_ SketchCow, I now have the space for punchfork if you still want another copy out there
07:32 πŸ”— SketchCow No, wehave a plan.
07:32 πŸ”— SketchCow They try and make it go crazy
07:32 πŸ”— SketchCow We just spirit it out
07:35 πŸ”— SketchCow Oh man, megaupload
07:41 πŸ”— SketchCow Yahoo! Korea now completely uploaded - thanks everyone.
07:41 πŸ”— chronomex yahoo.kp
07:47 πŸ”— SketchCow Now just verifying Punchfork uploading.
07:47 πŸ”— SketchCow These 50gb blocks sometimes clog in the system.
07:48 πŸ”— chronomex shocked
08:05 πŸ”— SketchCow Hooray, I've gotten one of the FOS drives to 10 percent.
08:11 πŸ”— SketchCow 210458.2 / 583096.0 MB Rate: 23787.6 / 3276.1 KB Uploaded: 759839.0 MB [36%] 1d 8:21 [ R: 3.61]
08:11 πŸ”— SketchCow InternetCensus2012
08:11 πŸ”— SketchCow 210gb
08:12 πŸ”— SketchCow And dude.... 759gb uploaded. What a good citizen.
08:13 πŸ”— chronomex like a boss
08:17 πŸ”— SketchCow Punchfork is so active, we have 39gb of what, as far as I can tell, is stuff stuck on the side of the tubes.
08:17 πŸ”— SketchCow I'll overlook it for now.
08:18 πŸ”— C-Keen run little warrior!
08:20 πŸ”— SketchCow Yeah, I'm happy - we're at the good point now on that drive. It's about 9tb free now. That's a lot of space.
08:21 πŸ”— SketchCow The other has 7.8tb drives.
08:24 πŸ”— C-Keen SketchCow: how are dumps usually processed and made accessible in IA?
08:24 πŸ”— SketchCow Which dumps.
08:24 πŸ”— C-Keen like of posterous
08:25 πŸ”— SketchCow Dumps where we've created, say, hundreds of gigabytes of grabs, we split them into 50gb chunks, and upload those into the archive in the form of a "MegaWARC", which is a WARC of WARCS.
08:25 πŸ”— C-Keen to rephrase the question: what does happen with the archive teams uploads and how do they get integrated into the IA's searchable data stuff
08:26 πŸ”— SketchCow By calling it a web object, and including the index we do, a process stops by and ingests the data into the Wayback.
08:26 πŸ”— SketchCow There used to be an embargo between the addition of data and Wayback availability. Current version doesn't have that.
08:26 πŸ”— SketchCow I'm trying to think of an example.
08:27 πŸ”— C-Keen I see. So modern web-archeologists can then either download the WARCs or use the wayback machine
08:27 πŸ”— SketchCow Yes.
08:27 πŸ”— SketchCow We have conversion utilities for taking these WARCs and making them not be WARCs. To be zips, or browsable.
08:28 πŸ”— C-Keen I also have seen some local webserver thingy that serves html pages out of a warc somewhere on github
08:29 πŸ”— SketchCow But that's the current situation.
08:29 πŸ”— SketchCow We're going to be more aggressive with putting out tools.
08:29 πŸ”— SketchCow And other "archive the archiving archivers" stuff.
08:29 πŸ”— C-Keen do you have a list of things you need help with?
08:30 πŸ”— C-Keen * or want
08:30 πŸ”— SketchCow Like, projects? Or things?
08:30 πŸ”— C-Keen whatever I potentially could do from the other side of your world
08:32 πŸ”— SketchCow I have boring things that I think change the nature of the world, but they're not 100% archive team projects.
08:32 πŸ”— SketchCow Archive Team's biggest issue is Wiki updating.
08:38 πŸ”— Cameron_D I'd love to do stuff to the wiki, but I have no idea where to start
08:40 πŸ”— SketchCow chronomex? Any suggestions?
08:41 πŸ”— chronomex hi
08:41 πŸ”— chronomex what about
08:41 πŸ”— chronomex borign thigns to change the world?
08:41 πŸ”— chronomex or at wiki janitor stuff?
08:41 πŸ”— Cameron_D just where to start editing/what needs to be done
08:41 πŸ”— chronomex hm
08:42 πŸ”— chronomex a bunch of the project-specific pages are stale w.r.t. status, disposition, etc
08:42 πŸ”— SketchCow BORING THINGS TO CHANGE THE WORLD would be a great slogan at the bottom of a crest.
08:42 πŸ”— chronomex yes
08:43 πŸ”— chronomex the archiveteam wiki is full of stale info, mostly
08:43 πŸ”— SketchCow Yes, I'd like it saved off into a way that we can still get to it, but it's marked as completed.
08:44 πŸ”— chronomex right
09:06 πŸ”— Cameron_D If the wiki prompts you to fill a captcha for anti-spam there is also a PHP error on the page
09:06 πŸ”— Cameron_D Warning: file_get_contents(/home/archivet/public_html/extensions/SpamBlacklist/wikimedia_blacklist) [function.file-get-contents]: failed to open stream: No such file or directory in /home/archivet/public_html/extensions/SpamBlacklist/SpamBlacklist_body.php on line 123
09:07 πŸ”— Cameron_D and that was poorly worded, the wiki prompted me to fill out a captcha becuase I included external links and it threw that error on the page
09:10 πŸ”— chronomex our wiki?
09:18 πŸ”— m1force http://www.filedropper.com/plutonomyandalgorithms | Professional research
11:24 πŸ”— no2pencil I wrote the following last night, for the phoenix : http://pastebin.com/Gsm5Qhg2
11:24 πŸ”— no2pencil doesn't grab photos.... code isn't the greatest, actually rather poor, but it's a work in progress.
12:50 πŸ”— SketchCow In WTF news, a hacker turned FBI informant from 30 years ago has sent me a box of Apple II Floppies
12:50 πŸ”— SketchCow Hilarious Tools Galore
12:53 πŸ”— SketchCow Also, good news, I have a machine I can WGET 1.14 on
12:53 πŸ”— SketchCow So more mirrors going in
12:53 πŸ”— ersi Neat.
12:56 πŸ”— no2pencil SketchCow: this is something you will put on archive.org? The Apple II Floppies/Tools?
12:58 πŸ”— SketchCow Ultimately, I expect so
12:58 πŸ”— SketchCow So, since I've partnered with TOSEC, I may begin putting images up on archive.org or another location, to ensure they're publically accessible
12:58 πŸ”— SketchCow And they can officially enter the pantheon
13:01 πŸ”— godane so IA is doing maintenance again
13:03 πŸ”— SketchCow No, there's a problem and we're dealing with it.
13:08 πŸ”— godane ok
13:09 πŸ”— GLaDOS Has anyone really been far even as decided to use even go want to do look more like?
13:10 πŸ”— GLaDOS Erm, crap, wrong button.
13:10 πŸ”— GLaDOS Has anyone else attempted to grab the Saints Row 3 community items, etc. on http://saintsrow.com?
13:11 πŸ”— omf_ GLaDOS, is a cyclon
13:11 πŸ”— omf_ the nlp filter failed
13:11 πŸ”— GLaDOS shh
13:19 πŸ”— GLaDOS But yeah, community creations on the Saints Row website
13:19 πŸ”— GLaDOS There is a way to get a search to list them all, but their search function injects the results after pageload
13:23 πŸ”— SmileyG yeah :(
13:23 πŸ”— SmileyG I'm trying to figure out the wget for the forums, however those community items are a pita.
15:35 πŸ”— SketchCow Also Has anyone really been far even as decided to use even go want to do look more like?
15:40 πŸ”— RedType ou've got to be kidding me. I've been further even more decided to use even go need to do look more as anyone can. Can you really be far even as decided half as much to use go wish for that?
16:50 πŸ”— SketchCow Some here are the last thoughts for them and here are so look like!
16:50 πŸ”— SketchCow http://www.flickr.com/photos/textfiles/8572533274/in/photostream/lightbox/
16:59 πŸ”— omf_ Since pingmag.jp came back I did a full site grab which has all their old content from 2005-2008 http://archive.org/details/PingmagArtDesignLife
17:27 πŸ”— * SmileyG looks at what RedType and SketchCow said and is so confused.
17:40 πŸ”— RedType SmileyG: it's a meme
17:40 πŸ”— RedType he acked, i syned
17:41 πŸ”— SmileyG ah ok
17:44 πŸ”— Samuel_Mi http://knowyourmeme.com/memes/has-anyone-really-been-far-even-as-decided-to-use-even-go-want-to-do-look-more-like
17:47 πŸ”— omf_ 4chandata is going again. I am getting 366,720 more images
17:47 πŸ”— SmileyG :O
17:47 πŸ”— omf_ and then the database dump
17:47 πŸ”— SmileyG i'm currently headbutting against GLaDOS's dedi he is letting me play with
17:47 πŸ”— SmileyG tim@MushaV3 ~ $ scp ./cookies.txt -v root@37.56.60.160:~
17:48 πŸ”— SmileyG not sure why it's not doing anyhting
17:48 πŸ”— SmileyG Oh, it's a 9, not 6 ¬_¬
17:49 πŸ”— omf_ typos are the mind killer
17:49 πŸ”— SmileyG yes, yes they are.
17:49 πŸ”— SmileyG shit, wrong wget on there :S
17:49 πŸ”— SmileyG time to go to -bs
19:12 πŸ”— Hooch ESET just detected a virus from the virtualbox http://i1140.photobucket.com/albums/n567/rand4505/issue.jpg
19:15 πŸ”— SketchCow ha ha
19:15 πŸ”— SketchCow Yeah don't browse that posterous site. Viruses ahoy
22:25 πŸ”— chronomex freenode -- | RichiH (~richih@freenode/staff/richih): [Global Notice] Hi all. PDPC, freenodeҀ™s parent organisation, has been dissolved. Details can be found at http://blog.freenode.net/ with a static copy at http://planet.freenode.net/
22:25 πŸ”— chronomex freenode -- | mrmist (~mrmist@freenode/staff/mrmist): [Global Notice] As a P.S. to the last global, no, we're not dying or going away. If the blog is down, you can read details at https://plus.google.com/b/104326727082310562426/104326727082310562426/posts/CMW4Gst657v thanks for flying freenode!
22:25 πŸ”— chronomex huh

irclogger-viewer