#archiveteam 2013-07-16,Tue

โ†‘back Search

Time Nickname Message
00:57 ๐Ÿ”— SketchCow OK, who wants to wget Hackaday?
01:19 ๐Ÿ”— godane SketchCow: i did that
01:20 ๐Ÿ”— godane SketchCow: see here: https://archive.org/search.php?query=hackaday
01:21 ๐Ÿ”— godane cleaner here: https://archive.org/search.php?query=collection%3A%22archiveteam-fire%22%20hackaday
01:30 ๐Ÿ”— winr4r godane: yo, feel like grabbing help.snapjoy.com and blog.snapjoy.com?
01:35 ๐Ÿ”— godane i'm mirroring snapjoy.com site and all sub dumps linked from there
01:36 ๐Ÿ”— winr4r godane: huuuuug
01:40 ๐Ÿ”— godane winr4r: I'm uploading it now
01:40 ๐Ÿ”— godane was only 14mb
01:41 ๐Ÿ”— SketchCow Bravo, godane. I've been getting enquiries.
01:42 ๐Ÿ”— godane looks like cloudfront.net hosts images of snapjoy users
01:42 ๐Ÿ”— winr4r godane: yup, we're on the case
01:47 ๐Ÿ”— godane uploaded: https://archive.org/details/snapjoy.com-20130715
01:48 ๐Ÿ”— godane looks like the feedback.snapjoy.com forums are gone
01:48 ๐Ÿ”— godane it redirects to the main site
01:55 ๐Ÿ”— winr4r godane: thanks :D
01:59 ๐Ÿ”— dashcloud SketchCow: what worries you the most about the hackaday plans?
01:59 ๐Ÿ”— godane that the archive of posts could disable
01:59 ๐Ÿ”— godane *disappear
02:00 ๐Ÿ”— godane i'm up to 2013-06 with hackaday
02:02 ๐Ÿ”— godane does anyone know how to make grep stop a grep at another patten line?
02:02 ๐Ÿ”— winr4r godane: explain
02:02 ๐Ÿ”— godane my idea is to grab gbtv/theblaze video key
02:03 ๐Ÿ”— godane but i'm always going to over grab
02:03 ๐Ÿ”— godane some of the xml data has a lot of keyworks
02:03 ๐Ÿ”— godane *keywords
02:03 ๐Ÿ”— godane so a fix -A20 of something may not work aways
02:04 ๐Ÿ”— godane *always
02:12 ๐Ÿ”— winr4r hm
02:17 ๐Ÿ”— winr4r i'm not sure you can with grep
02:19 ๐Ÿ”— godane it looks like the first 5 links work for me for most of the data
02:53 ๐Ÿ”— godane winr4r: i got it to work
02:54 ๐Ÿ”— godane i had to new line variables after find the video key
02:54 ๐Ÿ”— godane since the video key with everything is one line
02:55 ๐Ÿ”— godane i will not get any other data
02:55 ๐Ÿ”— winr4r ah :)
03:55 ๐Ÿ”— SketchCow Anyone grabbed ftp.atari.com?
04:03 ๐Ÿ”— SketchCow I'm grabbing it.
04:04 ๐Ÿ”— * winr4r salutes
04:05 ๐Ÿ”— SketchCow Man, this Manga collection I'm adding is just so much Yaoi
04:05 ๐Ÿ”— SketchCow I think it's possibly because I'm in the A's only so far, and that's got words Yaoi tends to use.
04:19 ๐Ÿ”— DFJustin aaan~
04:27 ๐Ÿ”— wp494 two things:
04:27 ๐Ÿ”— wp494 1. it would be appreciated if we get a #76days archivist for when things come up
04:27 ๐Ÿ”— wp494 (on freenode)
04:27 ๐Ÿ”— wp494 and 2. still looking for some coders in #pushharder
04:28 ๐Ÿ”— wp494 (here on EFNet)
04:29 ๐Ÿ”— wp494 (#76days is an investigation of recent happenings on the pronounciationbook youtube channel)
04:34 ๐Ÿ”— xmc care to provide some more background for the reprobates among us who don't know what that is?
04:42 ๐Ÿ”— wp494 76days?
04:42 ๐Ÿ”— wp494 https://docs.google.com/document/d/1UamrCTSCj7IleTVnxNn2mCGX7AsLC-AlOGAYghsKZA0
04:42 ๐Ÿ”— wp494 tldr pronounciationbook (the YT channel) has begun counting down each day from 76 a few days ago
04:42 ๐Ÿ”— wp494 currently at 71
04:42 ๐Ÿ”— wp494 4chan, other conspiracy groups investigating
04:43 ๐Ÿ”— wp494 the reason I bring it up here is because IIRC they found a vimeo page related to it, but its videos were deleted shortly afterwards
04:44 ๐Ÿ”— xmc oh it's one of those internet game things
04:45 ๐Ÿ”— winr4r aka "you're being trolled"
04:48 ๐Ÿ”— wp494 [23:45:21.721] <winr4r> aka "you're being trolled"
04:48 ๐Ÿ”— wp494 there's some speculation that it's been in the works for 4+ years
04:48 ๐Ÿ”— wp494 but only time will tell
04:49 ๐Ÿ”— winr4r said wp494 in the voice of einstein in the intro video for red alert 1
07:30 ๐Ÿ”— vba any word on whether PACER makes an effort to track down people with multiple accts, bringing each one up to just under the limit for not being billed?
08:31 ๐Ÿ”— alih-duck ร‚ยด
09:33 ๐Ÿ”— SketchCow FINISHED --2013-07-16 08:17:54--
09:33 ๐Ÿ”— SketchCow Total wall clock time: 4h 14m 33s
09:33 ๐Ÿ”— SketchCow Downloaded: 2036 files, 27G in 4h 3m 1s (1.87 MB/s)
09:34 ๐Ÿ”— ersi wroom
09:35 ๐Ÿ”— SketchCow zip -9 -r ftp.atari.com.2013.07.zip ftp.atari.com
09:35 ๐Ÿ”— Smiley Nice
09:35 ๐Ÿ”— SketchCow That'll take a while.
09:35 ๐Ÿ”— Smiley 1 file left to upload in pouet.com_full_grab
09:35 ๐Ÿ”— Smiley 90% done :D
10:52 ๐Ÿ”— Smiley more news on the hack-a-day buy/sell thing
10:52 ๐Ÿ”— Smiley http://hackaday.com/2013/07/15/were-going-to-buy-hackaday/
13:47 ๐Ÿ”— Nemo_bis Any update on the identi.ca deleted stuff being brought to archive.org?
16:31 ๐Ÿ”— SketchCow xmc: need your help in #jenga
19:59 ๐Ÿ”— WiK hello world
20:00 ๐Ÿ”— WiK omf_: finally got a NAS for all these repos, 16x harddrive bays
20:01 ๐Ÿ”— ivan` :-)
20:01 ๐Ÿ”— ivan` WiK: I might write some software that lets you store more repos
20:01 ๐Ÿ”— WiK now i just need to get some harddrives for in it
20:01 ๐Ÿ”— WiK ive got a lic copy of unraid for it as well
20:02 ๐Ÿ”— ivan` the git objects need to be stored uncompressed (but still packed) and the whole repo needs to be LZMA2'ed
20:02 ๐Ÿ”— ivan` git uses zlib which isn't so great
20:02 ๐Ÿ”— WiK well, dont know how well that would work, since im gonna allow ppl to submit egrep/grep strings to run on the data
20:03 ๐Ÿ”— WiK doing that wouldnt screw that up would it?
20:03 ๐Ÿ”— WiK http://wik-i-pedia.com/gitdigger
20:04 ๐Ÿ”— WiK 16x 4TB drives should give me more space then ill ever need
20:04 ๐Ÿ”— WiK or at least a mixture of 4TB and 2TB drives
20:04 ๐Ÿ”— ivan` WiK: I thought you were using git --mirror which stored just the git objects that you can't grep anyway
20:04 ๐Ÿ”— ivan` are you going to git-grep?
20:04 ๐Ÿ”— ivan` it would take quite a whole to grep everything
20:04 ๐Ÿ”— WiK im just doing a git clone
20:04 ๐Ÿ”— WiK and it does take quite awhile
20:05 ๐Ÿ”— WiK unless you multi-thread your grep
20:05 ๐Ÿ”— ivan` building a useful code search is much harder than storing as many repos as possible
20:05 ๐Ÿ”— ivan` github uses a large ElasticSearch cluster
20:07 ๐Ÿ”— WiK ya, i was gonna give that a shot, but i dont really plan on making this open to the public, so i dont really need 'fast'
20:07 ๐Ÿ”— ivan` also doesn't github already let you search all the github repos? ;)
20:08 ๐Ÿ”— ivan` well, the ones that haven't been deleted
20:08 ๐Ÿ”— WiK no, not with security related searches
20:08 ๐Ÿ”— ivan` ah
20:10 ๐Ÿ”— ivan` http://swtch.com/~rsc/regexp/regexp4.html https://code.google.com/p/codesearch/ is basically what Google Code Search did
20:11 ๐Ÿ”— ivan` you can build a trigram index for all the source files you have
20:13 ๐Ÿ”— WiK very interesting reading
20:13 ๐Ÿ”— WiK thanks
20:29 ๐Ÿ”— WiK now, to figure out how to make this index and keep it updated
20:30 ๐Ÿ”— WiK ahh i see, codesearch does that for you
21:45 ๐Ÿ”— ersi WiK: What NAS did ya' get?
21:45 ๐Ÿ”— ersi ElasticSearch is pretty nifty by the way
22:08 ๐Ÿ”— omf_ one problem down WiK :)
22:17 ๐Ÿ”— Nemo_bis ersi: you like it better than Solr?
22:17 ๐Ÿ”— Nemo_bis they're currently being considered for Wikimedia projects https://www.mediawiki.org/wiki/Requests_for_comment/CirrusSearch

irclogger-viewer