Time |
Nickname |
Message |
00:06
🔗
|
WiK |
sup |
00:06
🔗
|
WiK |
SketchCow: awesome job of the defcon doc |
00:07
🔗
|
WiK |
i was gonna introduce myself, but you were constantly surrounded |
00:21
🔗
|
balrog |
twitter account? https://twitter.com/MartinManleyUFR |
00:21
🔗
|
balrog |
Tephra: & |
00:21
🔗
|
balrog |
^* |
00:45
🔗
|
SketchCow |
Thanks |
01:35
🔗
|
SketchCow |
OK, top priority for me is dumping ALL the data I can off of my Internet Archive machine. |
01:35
🔗
|
SketchCow |
Currently 11 terabytes |
08:32
🔗
|
Tephra |
balrog: sweet! |
16:29
🔗
|
besc |
Hey, how do you guys save/archive articles or websites? I'm talking cutting out pictures of your kid from the newspaper type stuff |
16:32
🔗
|
omf_ |
Zotero is nice for saving a page and displaying it later. Comes as a browser plugin or stand alone |
16:32
🔗
|
besc |
omf_, thanks I'll give it a closer look |
16:34
🔗
|
ersi |
besc: There's also a Firefox addon that can save to "MAF" (Mozilla Archive Format): https://addons.mozilla.org/en-US/firefox/addon/mozilla-archive-format/ |
16:34
🔗
|
besc |
Hm, this looks too much like web 2.0 to me. Anything more old school? |
16:35
🔗
|
besc |
Ideally I want to save everything I deem worthy of saving using X system and be able to organize it on a file system level and search later |
16:36
🔗
|
antomatic |
There are command-line tools such as wget which might do the trick for you, besc? |
16:36
🔗
|
ersi |
I can't come up with any ideas for that unfortunately :/ Unless you use HTTrack/wget and get a bunch of files |
16:36
🔗
|
besc |
Can you search through WARC files? |
16:39
🔗
|
antomatic |
I'm sure it's possible but I don't know how. Alternatively you can use wget just to grab a page and the files (e.g. images, css sheets) that make it up, and have those stored as, say, files in a directory - so you can search or process them as any other. |
16:40
🔗
|
besc |
True. Well, I'll have to do some research :P |
17:41
🔗
|
besc |
Vanity Fair has a pretty weird pagination format ( http://www.vanityfair.com/style/2013/09/andre-leon-talley-fashion-profile for example) where the pages are hidden and made visible using javascript |
17:41
🔗
|
besc |
Would wget/warc still cover this? |
17:42
🔗
|
omf_ |
nope. wget does not process javascript |
17:43
🔗
|
besc |
The content is there, just made visible using JS. So wget should save it and I guess the WARC viewer would have to support JS |
17:46
🔗
|
omf_ |
besc, still won't work |
17:46
🔗
|
omf_ |
they are using JS to create the links and drive them, not just hide them |
17:49
🔗
|
besc |
Oh ok |
18:03
🔗
|
besc |
Actually it's good that wget doesn't run javascript, this means I get the full page without pagination - something I just found out |
18:23
🔗
|
WiK |
afternoon |
18:29
🔗
|
ersi |
evenin' |
18:48
🔗
|
WiK |
hows it going ersi ? |
18:49
🔗
|
ersi |
Pretty good, uploading DebianConf-videos to Internet Archive |
19:38
🔗
|
SketchCow |
Hurrah |
19:38
🔗
|
besc |
Does wget with WARC output save images too? |
19:38
🔗
|
SketchCow |
Yes |
19:39
🔗
|
besc |
Great, thanks. Haven't found a simple to use WARC viewer yet (without dependency on Python) |
19:39
🔗
|
SketchCow |
It's cool. |
19:42
🔗
|
SketchCow |
http://www3.alcatel-lucent.com/bstj/vol36-1957/bstj-vol36-issue01.html is a good one |
19:53
🔗
|
SketchCow |
2 left |
20:06
🔗
|
SketchCow |
Almost done! |
20:06
🔗
|
WiK |
humr, so far ive only downloaded 226 github projects today |
20:08
🔗
|
WiK |
looks like tuesday ill get the rasp pi's so i can make this project 100% own its own and off my man every day desktop |
20:14
🔗
|
SmileyG |
anyone grabbed this 400gb of files from wikileaks yet? |
20:14
🔗
|
* |
SmileyG giggles how this is turning into the dan brown story. |
20:18
🔗
|
SketchCow |
:Bell System Technical Journal, 39: 4 July 1960 pp 947-962. Synthesis of Driving-Point Impedances with Active RC Networks (Sandberg, I.W.) |
20:18
🔗
|
SketchCow |
I need to grab the wikileaks |
20:18
🔗
|
SketchCow |
Will do it. |
20:18
🔗
|
SketchCow |
Nobody waste time on it. |
20:19
🔗
|
SmileyG |
k. |
20:19
🔗
|
SmileyG |
Was going to offer tomorrow at work, shout if I can help in about.... 14 hours SketchCow |
20:21
🔗
|
SketchCow |
wlinsurance-20130815-A.aes256 62.6 / 3400.0 MB Rate: 0.0 / 1275.8 KB Uploaded: 0.0 MB [ 0%] 0d 0:44 [ R: 0.00] |
20:22
🔗
|
SketchCow |
Insurance File A, I will have in 44 seconds. |
20:22
🔗
|
SketchCow |
Wait, 13. |
20:22
🔗
|
godane |
there is a new insurance file? |
20:23
🔗
|
SketchCow |
Wait, no, 3 minutes. Anyway. |
20:23
🔗
|
SketchCow |
File B, I will have in less than 6 hours. |
20:24
🔗
|
godane |
i was just thinking it was release in like 2010 or something |
20:24
🔗
|
WiK |
man, so wish i could get fiber in my area |
20:24
🔗
|
WiK |
i only have 7200kbps down |
20:25
🔗
|
SketchCow |
OK! Bell System Technical Journal done. |
20:25
🔗
|
ersi |
yay |
20:25
🔗
|
ersi |
SmileyG: Tephra was doing it I think |
20:26
🔗
|
ersi |
godane: New insurance files |
20:26
🔗
|
SketchCow |
Insurance File A is now downloaded. |
20:47
🔗
|
SketchCow |
F T P . O S U O S L . O R G |
20:47
🔗
|
SketchCow |
Open Source Lab |
20:47
🔗
|
SketchCow |
Oregon State University |
20:47
🔗
|
SketchCow |
Unauthorized use is prohibited - violators will be prosecuted |
20:47
🔗
|
SketchCow |
.. |
20:47
🔗
|
SketchCow |
Welcome to our happy lab! Where you can PS STEP OUT OF LINE AND YOUR ASS IS ARRESTED |
20:48
🔗
|
WiK |
lol |
20:49
🔗
|
WiK |
do they define what 'stepping out of line' is? |
20:49
🔗
|
antomatic |
'unauthorised use'. Which I guess could mean anything up to and including "Did we authorise you to type those letters, in that order?" |
20:49
🔗
|
WiK |
exactly |
20:51
🔗
|
antomatic |
Apparently Parker Lewis *can* lose. |
20:51
🔗
|
WiK |
hahaah |
21:00
🔗
|
SketchCow |
Uploading Insurance File A. |
21:00
🔗
|
SketchCow |
I wonder what collection to shove it in. |
21:05
🔗
|
antomatic |
"Gee those were some nice servers we had once", possibly? :) |
21:05
🔗
|
SketchCow |
http://archive.org/details/wlinsurance-20130815-A.aes256 |
21:11
🔗
|
SketchCow |
Can someone please download http://goanimate.com/videos/0sg-lFuXxVW0 ? |
21:20
🔗
|
SketchCow |
Here they go, 1.4tb of MESS CHDs. |
21:29
🔗
|
WiK |
omf_: http://redteamers.com/blog/intel-links-from-classwork |
21:29
🔗
|
WiK |
check that out, my project is being 'taught' as a redteam recon source |
21:29
🔗
|
WiK |
how sweet is that :) |
21:38
🔗
|
balrog |
insurance file A is small |
21:38
🔗
|
balrog |
B and especially C, now those are large |
21:39
🔗
|
SketchCow |
Yes. |
21:39
🔗
|
SketchCow |
As soon as I clear out some disk space, I get C. |
21:40
🔗
|
SketchCow |
B is coming as we speak. |
21:52
🔗
|
dashcloud |
WiK: that's pretty cool |
21:56
🔗
|
godane |
does anyone have a invite to cinemageddon? |
21:57
🔗
|
godane |
also i'm grabing the original wendy's grill skill's 1989 video |
23:38
🔗
|
Famicoman |
godane yes |
23:38
🔗
|
Famicoman |
pm me your email when you get a chance |
23:39
🔗
|
Famicoman |
but, they do have open registration like every hour |
23:39
🔗
|
Famicoman |
so if I don't get to you in time you can just keep pinging the registration page |
23:47
🔗
|
godane |
hey mistrym |
23:47
🔗
|
godane |
did you get word from WGBH Archives yet? |