Time |
Nickname |
Message |
00:57
๐
|
SketchCow |
OK, who wants to wget Hackaday? |
01:19
๐
|
godane |
SketchCow: i did that |
01:20
๐
|
godane |
SketchCow: see here: https://archive.org/search.php?query=hackaday |
01:21
๐
|
godane |
cleaner here: https://archive.org/search.php?query=collection%3A%22archiveteam-fire%22%20hackaday |
01:30
๐
|
winr4r |
godane: yo, feel like grabbing help.snapjoy.com and blog.snapjoy.com? |
01:35
๐
|
godane |
i'm mirroring snapjoy.com site and all sub dumps linked from there |
01:36
๐
|
winr4r |
godane: huuuuug |
01:40
๐
|
godane |
winr4r: I'm uploading it now |
01:40
๐
|
godane |
was only 14mb |
01:41
๐
|
SketchCow |
Bravo, godane. I've been getting enquiries. |
01:42
๐
|
godane |
looks like cloudfront.net hosts images of snapjoy users |
01:42
๐
|
winr4r |
godane: yup, we're on the case |
01:47
๐
|
godane |
uploaded: https://archive.org/details/snapjoy.com-20130715 |
01:48
๐
|
godane |
looks like the feedback.snapjoy.com forums are gone |
01:48
๐
|
godane |
it redirects to the main site |
01:55
๐
|
winr4r |
godane: thanks :D |
01:59
๐
|
dashcloud |
SketchCow: what worries you the most about the hackaday plans? |
01:59
๐
|
godane |
that the archive of posts could disable |
01:59
๐
|
godane |
*disappear |
02:00
๐
|
godane |
i'm up to 2013-06 with hackaday |
02:02
๐
|
godane |
does anyone know how to make grep stop a grep at another patten line? |
02:02
๐
|
winr4r |
godane: explain |
02:02
๐
|
godane |
my idea is to grab gbtv/theblaze video key |
02:03
๐
|
godane |
but i'm always going to over grab |
02:03
๐
|
godane |
some of the xml data has a lot of keyworks |
02:03
๐
|
godane |
*keywords |
02:03
๐
|
godane |
so a fix -A20 of something may not work aways |
02:04
๐
|
godane |
*always |
02:12
๐
|
winr4r |
hm |
02:17
๐
|
winr4r |
i'm not sure you can with grep |
02:19
๐
|
godane |
it looks like the first 5 links work for me for most of the data |
02:53
๐
|
godane |
winr4r: i got it to work |
02:54
๐
|
godane |
i had to new line variables after find the video key |
02:54
๐
|
godane |
since the video key with everything is one line |
02:55
๐
|
godane |
i will not get any other data |
02:55
๐
|
winr4r |
ah :) |
03:55
๐
|
SketchCow |
Anyone grabbed ftp.atari.com? |
04:03
๐
|
SketchCow |
I'm grabbing it. |
04:04
๐
|
* |
winr4r salutes |
04:05
๐
|
SketchCow |
Man, this Manga collection I'm adding is just so much Yaoi |
04:05
๐
|
SketchCow |
I think it's possibly because I'm in the A's only so far, and that's got words Yaoi tends to use. |
04:19
๐
|
DFJustin |
aaan~ |
04:27
๐
|
wp494 |
two things: |
04:27
๐
|
wp494 |
1. it would be appreciated if we get a #76days archivist for when things come up |
04:27
๐
|
wp494 |
(on freenode) |
04:27
๐
|
wp494 |
and 2. still looking for some coders in #pushharder |
04:28
๐
|
wp494 |
(here on EFNet) |
04:29
๐
|
wp494 |
(#76days is an investigation of recent happenings on the pronounciationbook youtube channel) |
04:34
๐
|
xmc |
care to provide some more background for the reprobates among us who don't know what that is? |
04:42
๐
|
wp494 |
76days? |
04:42
๐
|
wp494 |
https://docs.google.com/document/d/1UamrCTSCj7IleTVnxNn2mCGX7AsLC-AlOGAYghsKZA0 |
04:42
๐
|
wp494 |
tldr pronounciationbook (the YT channel) has begun counting down each day from 76 a few days ago |
04:42
๐
|
wp494 |
currently at 71 |
04:42
๐
|
wp494 |
4chan, other conspiracy groups investigating |
04:43
๐
|
wp494 |
the reason I bring it up here is because IIRC they found a vimeo page related to it, but its videos were deleted shortly afterwards |
04:44
๐
|
xmc |
oh it's one of those internet game things |
04:45
๐
|
winr4r |
aka "you're being trolled" |
04:48
๐
|
wp494 |
[23:45:21.721] <winr4r> aka "you're being trolled" |
04:48
๐
|
wp494 |
there's some speculation that it's been in the works for 4+ years |
04:48
๐
|
wp494 |
but only time will tell |
04:49
๐
|
winr4r |
said wp494 in the voice of einstein in the intro video for red alert 1 |
07:30
๐
|
vba |
any word on whether PACER makes an effort to track down people with multiple accts, bringing each one up to just under the limit for not being billed? |
08:31
๐
|
alih-duck |
รยด |
09:33
๐
|
SketchCow |
FINISHED --2013-07-16 08:17:54-- |
09:33
๐
|
SketchCow |
Total wall clock time: 4h 14m 33s |
09:33
๐
|
SketchCow |
Downloaded: 2036 files, 27G in 4h 3m 1s (1.87 MB/s) |
09:34
๐
|
ersi |
wroom |
09:35
๐
|
SketchCow |
zip -9 -r ftp.atari.com.2013.07.zip ftp.atari.com |
09:35
๐
|
Smiley |
Nice |
09:35
๐
|
SketchCow |
That'll take a while. |
09:35
๐
|
Smiley |
1 file left to upload in pouet.com_full_grab |
09:35
๐
|
Smiley |
90% done :D |
10:52
๐
|
Smiley |
more news on the hack-a-day buy/sell thing |
10:52
๐
|
Smiley |
http://hackaday.com/2013/07/15/were-going-to-buy-hackaday/ |
13:47
๐
|
Nemo_bis |
Any update on the identi.ca deleted stuff being brought to archive.org? |
16:31
๐
|
SketchCow |
xmc: need your help in #jenga |
19:59
๐
|
WiK |
hello world |
20:00
๐
|
WiK |
omf_: finally got a NAS for all these repos, 16x harddrive bays |
20:01
๐
|
ivan` |
:-) |
20:01
๐
|
ivan` |
WiK: I might write some software that lets you store more repos |
20:01
๐
|
WiK |
now i just need to get some harddrives for in it |
20:01
๐
|
WiK |
ive got a lic copy of unraid for it as well |
20:02
๐
|
ivan` |
the git objects need to be stored uncompressed (but still packed) and the whole repo needs to be LZMA2'ed |
20:02
๐
|
ivan` |
git uses zlib which isn't so great |
20:02
๐
|
WiK |
well, dont know how well that would work, since im gonna allow ppl to submit egrep/grep strings to run on the data |
20:03
๐
|
WiK |
doing that wouldnt screw that up would it? |
20:03
๐
|
WiK |
http://wik-i-pedia.com/gitdigger |
20:04
๐
|
WiK |
16x 4TB drives should give me more space then ill ever need |
20:04
๐
|
WiK |
or at least a mixture of 4TB and 2TB drives |
20:04
๐
|
ivan` |
WiK: I thought you were using git --mirror which stored just the git objects that you can't grep anyway |
20:04
๐
|
ivan` |
are you going to git-grep? |
20:04
๐
|
ivan` |
it would take quite a whole to grep everything |
20:04
๐
|
WiK |
im just doing a git clone |
20:04
๐
|
WiK |
and it does take quite awhile |
20:05
๐
|
WiK |
unless you multi-thread your grep |
20:05
๐
|
ivan` |
building a useful code search is much harder than storing as many repos as possible |
20:05
๐
|
ivan` |
github uses a large ElasticSearch cluster |
20:07
๐
|
WiK |
ya, i was gonna give that a shot, but i dont really plan on making this open to the public, so i dont really need 'fast' |
20:07
๐
|
ivan` |
also doesn't github already let you search all the github repos? ;) |
20:08
๐
|
ivan` |
well, the ones that haven't been deleted |
20:08
๐
|
WiK |
no, not with security related searches |
20:08
๐
|
ivan` |
ah |
20:10
๐
|
ivan` |
http://swtch.com/~rsc/regexp/regexp4.html https://code.google.com/p/codesearch/ is basically what Google Code Search did |
20:11
๐
|
ivan` |
you can build a trigram index for all the source files you have |
20:13
๐
|
WiK |
very interesting reading |
20:13
๐
|
WiK |
thanks |
20:29
๐
|
WiK |
now, to figure out how to make this index and keep it updated |
20:30
๐
|
WiK |
ahh i see, codesearch does that for you |
21:45
๐
|
ersi |
WiK: What NAS did ya' get? |
21:45
๐
|
ersi |
ElasticSearch is pretty nifty by the way |
22:08
๐
|
omf_ |
one problem down WiK :) |
22:17
๐
|
Nemo_bis |
ersi: you like it better than Solr? |
22:17
๐
|
Nemo_bis |
they're currently being considered for Wikimedia projects https://www.mediawiki.org/wiki/Requests_for_comment/CirrusSearch |