Time |
Nickname |
Message |
00:47
π
|
pft |
so, weird question. is it bad form for me to use wget to WARC stuff from the wayback machine? |
00:49
π
|
pft |
i'd like to archive previous employer web sites for when the domains inevitably expire and a new domain owner potentially robots.txt's me out of stuff i worked on |
00:57
π
|
SketchCow |
No. |
00:57
π
|
SketchCow |
Save everything however you want. |
00:57
π
|
pft |
ok |
00:57
π
|
pft |
thanks :) |
00:57
π
|
SketchCow |
Archive Team exists to give people choices, instead of all those who would take them away. |
01:03
π
|
pft |
hmm even though wayback's robots.txt looks like it disallows it? |
01:06
π
|
instence |
whats the best way to strip multi-line blocks of ad code from HTML in a post process? |
01:06
π
|
instence |
is there a way to set a text file as matching equivelant in a perl find/replace via cmd line? |
01:14
π
|
joepie91 |
I'd imagine sed would come in handy here? |
01:14
π
|
joepie91 |
not that I know how to use it, but still |
01:15
π
|
joepie91 |
:P |
01:25
π
|
instence |
sed is great for simpler 1 line replaces |
01:25
π
|
instence |
sed doesn't support lookahead/lookbehind |
01:25
π
|
instence |
and its regex engine is limited |
01:25
π
|
instence |
though there are more advanced versions of sed available that can do more, perl is much better for dealing with more complex multi-line regex's |
01:43
π
|
mistym |
Yeah, POSIX sed has expanded regex support (with -E) and GNU sed has a variety of extensions, but it's not very portable. Might as well go straight to something more advanced at that point. |
01:48
π
|
mistym |
(Also I really don't like how gsed makes you pass --posix to get posix-compliant behaviour... when --posix is not a valid POSIX option and will make BSD sed barf) |
01:51
π
|
instence |
to force perl to parse multi lines in a cmdline type setup I had to add -0p |
01:51
π
|
instence |
which is SO not obvious |
01:51
π
|
instence |
had to find it on stackoverflow |
01:51
π
|
mistym |
Huh, that's not listed in man perl but is in perl --help. |
02:49
π
|
winr4r |
well, i'm late to the party, but mistym is wrong about having to care about portability with sed |
04:49
π
|
godane |
so full interviews of cbc spark for year 2007 is uploaded now |
04:55
π
|
godane |
good news everyone |
04:56
π
|
godane |
i found berwster kahle interview on spark |
04:56
π
|
godane |
the full interview |
04:56
π
|
godane |
this is the file url: http://thumbnails.cbc.ca/maven_legacy/thumbnails/13/481/bonussparkplus_20080909_16405_uploaded.mp3 |
05:24
π
|
instence |
joepie91: i managed to get multi-line replace working with sed |
05:24
π
|
instence |
I had to, since perl in cygwin likes to create .bak files for every file that it modifies |
05:26
π
|
omf_ |
not if you do it this way instence |
05:26
π
|
omf_ |
perl -pi -e 's/old/new/' |
05:26
π
|
omf_ |
inline file edit |
05:27
π
|
instence |
http://stackoverflow.com/questions/11074322/does-perls-i-with-no-argument-create-a-backup-file-on-cygwin |
05:29
π
|
omf_ |
wow subtle windows problems :( |
05:29
π
|
instence |
ye :/ |
05:29
π
|
instence |
yea |
05:31
π
|
instence |
so i'm using this: |
05:31
π
|
instence |
sed -r ':a;N;$!ba; s/FIND/REPLACE/g' file.html |
05:37
π
|
godane |
can anyone get this one to stream: http://www.cbc.ca/player/Radio/Spark/ID/2235141926/ |
05:46
π
|
winr4r |
godane: nope |
06:10
π
|
omf_ |
Anyone been to pumpcon |
06:14
π
|
SketchCow |
I have. |
06:18
π
|
godane |
so funny thing about spark player |
06:18
π
|
godane |
when you click on 2008 year in full episodes |
06:18
π
|
godane |
only one episode is there |
06:19
π
|
godane |
cbc spark is some how trying to hide or delete |
06:19
π
|
godane |
stuff |
07:28
π
|
instence |
1 way to drive yourself crazy: forgetting to save .sh file as unix line formatting in a windows environment to execute in cygwin |
07:29
π
|
omf_ |
I cannot remember does cygwin come with dos2unix? |
07:34
π
|
GLaDOS |
You know, I've never liked that price increase they put on Australians.. http://i.imgur.com/JmmjzpZ.png |
07:55
π
|
instence |
i would have to check |
07:55
π
|
instence |
there are ways to convert the linefeeds to unix from windows using sed and other apps |
07:56
π
|
instence |
though i just had to remember to use unix EOL when saving my .sh file from notepad++ |
08:48
π
|
godane |
starting to upload g4 e3 2012 full episode videos |
09:30
π
|
Tephra |
anyone have any twitter user archiving scripts? |
09:40
π
|
ivan` |
Tephra: I just load Twitter's JS and use ctrl-s with Mozilla Archive Format |
09:40
π
|
ivan` |
I'm sure there's some archiver on github somewhere |
10:09
π
|
Tephra |
fantastic, the paper: "The Continued Movement for Open Access to Peer-Reviewed Literature" --- paywalled! |
13:03
π
|
joepie91 |
instence: yay! |
14:00
π
|
GLaDOS |
SketchCow: am I right in assuming that http://archive.org/details/martinmanleylifeanddeath.com-20130816 will be ingested into the Wayback eventually? |
14:01
π
|
GLaDOS |
Or, hell, anyone here? |
14:02
π
|
BlueMax |
hello. |
14:02
π
|
GLaDOS |
Hi. |
14:05
π
|
SmileyG |
o/ |
14:06
π
|
SmileyG |
GLaDOS: i believe all warcs are, yes. |
14:06
π
|
ersi |
All warcs that are in the right collection are |
14:06
π
|
GLaDOS |
That collection being web? |
14:06
π
|
ersi |
IIRC they should be in the "Web Crawl" collection |
14:06
π
|
ersi |
GLaDOS: Mediatype != collection |
14:07
π
|
GLaDOS |
Ah. |
14:07
π
|
ersi |
Web Crawls > Archive Team > The Archive Team Just In Time Grabs |
14:07
π
|
SmileyG |
ahhh |
14:07
π
|
ersi |
It's in a sub-sub collection of Web Crawls. So yes, it'll be available in Wayback |
14:08
π
|
ersi |
Disclaimer: I could be wrong about this, but if I recall correctly.. this should be the case |
14:08
π
|
ersi |
teehee~ I've uploaded 21 videos now |
14:52
π
|
antomatic |
I guess this needs to be moved, then: https://archive.org/details/Uponfurtherreview.blog.comPanicgrab20130815.warc |
14:52
π
|
antomatic |
(tries to log in) |
14:52
π
|
antomatic |
(fails) |
14:52
π
|
antomatic |
(has forgotten password) |
14:52
π
|
antomatic |
(dah!) |
14:53
π
|
antomatic |
Magical goblins change my passwords without me knowing. bleeh. :) |
15:40
π
|
joepie91 |
http://gratisoptehalen.nl/advertentie.php?id=241847 |
15:40
π
|
joepie91 |
hmm |
15:40
π
|
joepie91 |
bunch of old pieces of software, some Dutch some English |
15:41
π
|
joepie91 |
(it's free, just shipping costs) |
15:41
π
|
joepie91 |
worth getting? |
15:41
π
|
joepie91 |
(and archiving) |
15:41
π
|
ersi |
probably :) |
15:42
π
|
joepie91 |
really should keep more of an eye on that stuff |
15:42
π
|
joepie91 |
that site in particular |
15:42
π
|
joepie91 |
lots of this old stuff coming by |
15:42
π
|
omf_ |
joepie91, what up :o) |
15:43
π
|
joepie91 |
ohai |
15:43
π
|
joepie91 |
about to take out the trash |
16:20
π
|
SketchCow |
GLaDOS: Of course it will. |
16:21
π
|
SketchCow |
All archiveteam 'web' type objects are ingested roughly every two weeks. |
16:21
π
|
SketchCow |
ersi is wrong. It's items set 'web'. |
16:21
π
|
SketchCow |
That's why not everyone can set that. |
16:22
π
|
SketchCow |
https://archive.org/details/Uponfurtherreview.blog.comPanicgrab20130815.warc now fixed. |
16:24
π
|
godane |
so i got g4 e3 2012 stuff uploaded now |
16:24
π
|
godane |
and checked in |
16:24
π
|
antomatic |
thanks so much, Sketchcow. |
16:27
π
|
ersi |
ah, alright |
16:28
π
|
ersi |
SketchCow: So the 'Web Crawls' collection-connection has nothing to do with it? Just the mediatype? |
16:35
π
|
SketchCow |
Right. |
16:35
π
|
SketchCow |
Well, to be MORE specific... |
16:36
π
|
SketchCow |
The mediatype is the definer. "web". There's attempts in terms of placing items in collections to ensure that web crawls are bunched together for a pure organizational effort, but they're not affecting The Programs. |
16:36
π
|
ersi |
Ah, alright. |
16:36
π
|
SketchCow |
The Programs are crawling through the archive.org items, finding ones with a "web" mediatype, and if they're new or changed, ingesting them. |
16:37
π
|
SketchCow |
But The Programs are NOT going to things with a "texts" mediatype, going "well, hmmm, it has a warc.gz, let's add it too". |
16:37
π
|
SketchCow |
The fact this goes on at all to the level it does is because of me at the archive. |
16:37
π
|
SketchCow |
It used to be something not quite done. |
16:37
π
|
ersi |
This is way better than nothing :) |
16:37
π
|
SketchCow |
Now we're doing it so much we're contributing major blocks of the internet into the wayback machine. |
16:38
π
|
ersi |
I'm just really curious how things work at Internet Archive.. Unfortunately, details are really.. not existant for us mere mortals :) |
16:38
π
|
SketchCow |
Yeah, again, that's because of how they've been set up for years. |
16:38
π
|
SketchCow |
It was rather painful for them, the amount of opening I'm forcing. |
16:38
π
|
* |
ersi nods understandingly |
16:39
π
|
SketchCow |
One or two devs and I did not get along over it. |
16:39
π
|
SketchCow |
And there is still some fear about it. |
16:39
π
|
ersi |
I'm sure more than just us, would be *really* interested in reading about a lot of IA things. I'm sure that could be used to drive donations as well. |
16:39
π
|
joepie91 |
agreed with ersi there |
16:40
π
|
joepie91 |
the whole IA setup fascinates me tbh |
16:40
π
|
SketchCow |
But! You bring in the crazy open-access insane activist and you get what you get. |
16:40
π
|
joepie91 |
(which is why I quite enjoyed your recent post, SketchCow) |
16:40
π
|
ersi |
It could also drive volunteer effort in contributing other efforts. Like coding on wayback and tools |
16:40
π
|
joepie91 |
heh |
16:40
π
|
* |
ersi shrugs |
16:40
π
|
ersi |
I forgot about those |
16:40
π
|
joepie91 |
"well, that's just part of the package, guys!" |
16:40
π
|
SketchCow |
So, I'm walking through a lot of this. |
16:41
π
|
SketchCow |
It took months to get subscriptions working in the system. |
16:41
π
|
SketchCow |
Just slow will to radical change. |
16:41
π
|
ersi |
I signed up, by the way. |
16:41
π
|
ersi |
Totally worth while |
16:41
π
|
joepie91 |
uploaded: https://archive.org/details/KeygenMusicPack-July2013 |
16:41
π
|
SketchCow |
So give me time, I'm spending a lot of effort to fix a lot of things. |
16:41
π
|
joepie91 |
... in hindsight, would it have been better to upload it as a .zip instead of the original .7z, so that you can browse it? |
16:41
π
|
SketchCow |
http://archive.org/details/software - that didn't exist like that a mere year ago. |
16:41
π
|
ersi |
Of course SketchCow. I'm just.. excited about things. :) |
16:41
π
|
SketchCow |
Now it's fuckin'..... it's the bomb |
16:42
π
|
ersi |
da bomb |
16:42
π
|
SketchCow |
Also, I'm hand-cleaning things today. |
16:43
π
|
SketchCow |
For example, I have a small pile of Bell System Technical Journal papers to go in. |
16:43
π
|
SketchCow |
I'm REALLY trying to murder this 11tb backlog on my archive.org machine. |
16:43
π
|
ersi |
Hahah, niiice |
16:44
π
|
SketchCow |
Just 25 more journal papers to go in, but I can't use the script, it's all by hand. |
16:44
π
|
SketchCow |
http://archive.org/details/bstj-archives |
16:44
π
|
SketchCow |
But see? There's 4,337 items, just sitting there. |
16:44
π
|
ersi |
Yeah, I feel ya'. I'm doing the DebConf12-videos by hand as well. |
16:44
π
|
joepie91 |
mm, found a bunch of shady links, wat do? send email? |
16:45
π
|
ersi |
info@archive.org I think |
16:45
π
|
SketchCow |
When I turn to http://archive.org/details/hackercons it will be glorious. |
16:45
π
|
SketchCow |
Yes, give Jeff stuff. |
16:45
π
|
SketchCow |
(info@archive.org) |
16:45
π
|
SketchCow |
Jeff is a goddamned master. |
16:47
π
|
joepie91 |
email sent |
16:48
π
|
joepie91 |
guessing malware |
16:48
π
|
joepie91 |
spammy descriptions, files that are too small to contain the listed software, and a seemingly auto-generated e-mail address for the uploader |
16:48
π
|
joepie91 |
different for each |
16:48
π
|
joepie91 |
despite having highly similar description and title formats |
16:48
π
|
* |
joepie91 puts the popped up red flags back in the bin |
17:05
π
|
godane |
SketchCow: the title on this item should be fixed: http://archive.org/details/MmprMagazine-Fall1994 |
17:11
π
|
joepie91 |
http://sebsauvage.net/paste/?bca6cef7a70dfb9c#JqGn/10j/zVeN8kyLK0I2w4OwSBGPH2ZCabTbPc6qpY= |
17:22
π
|
godane |
anyone willing to mirror my old isos and scripts? |
17:22
π
|
godane |
here is the link: http://arch-live.isawsome.net/ |
17:22
π
|
godane |
i ask cause i think its too big for me to mirror |
17:24
π
|
SketchCow |
godane: fixed. |
17:24
π
|
SketchCow |
joepie91: Bear in mind that we do notice things like that and do cleaning runs. Many. |
17:25
π
|
omf_ |
SketchCow, is there an abuse only email address or is it just info@ |
17:35
π
|
omf_ |
The media makes me want to take a drone strike out on them http://www.huffingtonpost.com/2013/08/17/michael-grunwald-julian-assange_n_3773981.html |
17:35
π
|
omf_ |
(Γ’ΒΒ―Γ°ÒΒΒ‘ΓΒ°)Γ’ΒΒ―Γ―ΒΈΒ΅ ΓΒuΓΒ±zΓΒΓΒΓΒW ΓΒΓΒ―ΓΒ±Γ’ΒΒ₯ |
17:42
π
|
joepie91 |
omf_: that guy certainly managed to get himself very near the top of my "people I would never get along with" list |
17:47
π
|
godane |
uploaded: http://archive.org/details/G4.Comic-Con.2012.Live.HDTV.x264-Eclipse |
17:47
π
|
godane |
you now got the g4 specials for 2012 |
17:51
π
|
yipdw |
joepie91: yeah, Grunwald's a pussy |
17:51
π
|
yipdw |
well |
17:51
π
|
yipdw |
actually, no |
17:52
π
|
yipdw |
that's a bad term |
17:52
π
|
yipdw |
it implies females are weak |
17:52
π
|
yipdw |
he just sucks |
17:52
π
|
SketchCow |
Grunwald's just an idiot |
17:52
π
|
SketchCow |
And he forgot that his twitter account is a professional representation. |
17:52
π
|
SketchCow |
I promise you, Journalists say terrible things all the time. |
17:52
π
|
SketchCow |
Just not usually into an open mike. |
17:52
π
|
joepie91 |
SketchCow: I'm not sure he so much 'forgot' as just didn't give a shit... |
17:52
π
|
SketchCow |
That was an open mike. |
17:52
π
|
SketchCow |
No, again, journalists are awful, dude. |
17:53
π
|
SketchCow |
It's just this one went awful for no reason |
17:53
π
|
joepie91 |
oh, I know that a lot of them are, I've dealt with quite a few of them, and know a few personally... |
17:53
π
|
joepie91 |
but yes |
17:53
π
|
joepie91 |
most of them know how far to publicize their thoughts |
17:53
π
|
joepie91 |
and where to stop |
17:53
π
|
joepie91 |
Grunwald apparently did not |
17:56
π
|
SketchCow |
Yeah, made a mistake. |
17:57
π
|
omf_ |
http://i.imgur.com/Ha2wD19.gif |
18:00
π
|
joepie91 |
jetpack cat |
18:12
π
|
ersi |
SketchCow: Could you create an collection for DefCon12 and these items: http://burl.se/3ce ? |
18:14
π
|
underscor |
<SketchCow> It was rather painful for them, the amount of opening I'm forcing. |
18:14
π
|
ersi |
SketchCow: I meant DebConf12 |
18:14
π
|
ersi |
underscor: How was Defcon? :) |
18:14
π
|
underscor |
I'm just imagining jason with elbow length rubber gloves and a crowbar |
18:14
π
|
ersi |
Mr sound guy |
18:16
π
|
underscor |
pretty fun! |
18:16
π
|
underscor |
a little bit of a blur |
18:16
π
|
underscor |
but I throughly enjoyed it |
18:16
π
|
ersi |
nice ^_^ yeah, that means it was fun though |
18:16
π
|
ersi |
I noticed you by name in the credits |
18:16
π
|
ersi |
of the documentary |
18:16
π
|
underscor |
aside from having a mental breakdown on, uh, saturday night I think |
18:16
π
|
underscor |
but I needed that |
18:16
π
|
underscor |
doing a lot better now |
18:17
π
|
underscor |
hehee, yay ^^ |
18:17
π
|
ersi |
sounds.. bad. Hope it helped in the long run though |
18:17
π
|
underscor |
it was truly awful |
18:17
π
|
ersi |
You guys should come over for some Europe-action sometime |
18:17
π
|
underscor |
I'd been spiraling into depression for the prior month |
18:18
π
|
underscor |
That's when it came to a head (*cough* A drink didn't help with that *cough*) |
18:18
π
|
* |
omf_ sends hugs underscor's way |
18:18
π
|
underscor |
But that was also the turning point to getting out of it |
18:18
π
|
ersi |
naw, drinks ain't helpin' against that kind of thing |
18:18
π
|
underscor |
and I am doing well now |
18:19
π
|
underscor |
Crying for a few hours in the corner of an acquaintance's hotel room in a casino of which I am still unsure of the name of |
18:19
π
|
underscor |
can be therapeutic |
18:19
π
|
underscor |
I guess |
18:19
π
|
underscor |
lol |
18:19
π
|
ersi |
Las Vegas, baby. |
18:20
π
|
SketchCow |
http://i.imgur.com/0C9zB0k.jpg |
18:20
π
|
ersi |
hahah |
18:20
π
|
ersi |
amen! |
18:20
π
|
underscor |
SketchCow: hah, I thought of you when I read that earlier |
18:24
π
|
SketchCow |
Yeah, great emo weekend for underscor |
18:24
π
|
SketchCow |
Leaving Las Vegas Jr. |
18:25
π
|
underscor |
It was fun! |
18:25
π
|
underscor |
I turned the defcon knob up to 11 though |
18:25
π
|
underscor |
from like, 0.35 |
18:25
π
|
SketchCow |
Well, growing up is tough, especially when you have a patchy support system like before you met me. |
18:25
π
|
underscor |
Yeah <3 |
18:25
π
|
SketchCow |
You actually did defcon knob to 6-7 before because you kept going out at night after "work". |
18:25
π
|
SketchCow |
Which was stuuuuuuuupid |
18:25
π
|
SketchCow |
But that's what you do at 19 |
18:26
π
|
SketchCow |
Stuuuuuuuuuuppppid |
18:26
π
|
* |
underscor sheepish grin |
18:26
π
|
SketchCow |
I have audio recording of you plotting to skip out |
18:26
π
|
SketchCow |
it is adorbs. |
18:26
π
|
underscor |
hahahaha |
18:26
π
|
underscor |
I remember that |
18:27
π
|
godane |
so looks like Brewster Kahle may have archived techtv |
18:27
π
|
underscor |
I remember leaving the tripod on the bed |
18:27
π
|
underscor |
and that was the only record of my existance |
18:27
π
|
SketchCow |
Personally? I doubt that |
18:27
π
|
godane |
only cause he said the tv archive started in 2000 |
18:27
π
|
underscor |
existence* |
18:27
π
|
SketchCow |
Oh, I see what you mean. |
18:27
π
|
SketchCow |
It is likely we have TechTV and a bunch of other material on the TV archive. |
18:27
π
|
SketchCow |
Doesn't mean we don't need you saving/classifying |
18:28
π
|
SketchCow |
For my own bit, I'm still trying to knock FOS down from 11gb of data. |
18:28
π
|
godane |
i know |
18:28
π
|
ersi |
:D |
18:28
π
|
ersi |
11GB? Surely you meant either TB or PB |
18:28
π
|
godane |
i at least add key works like the people in the shows |
18:29
π
|
SketchCow |
11tb |
18:29
π
|
godane |
its 11TB |
18:29
π
|
SketchCow |
Anyway, back to some REALLY tedious tasks that I have been putting off for months. |
18:30
π
|
SketchCow |
Underscor, let me know when you're available to do work. |
18:30
π
|
SketchCow |
Also, don't let depression get the best of you next time. |
18:30
π
|
SketchCow |
You're going to have near misses a few more times in the next 10 years. |
18:30
π
|
godane |
good news is i may get all g4 web videos fully uploaded in 2 weeks |
18:30
π
|
SketchCow |
Don't be that guy. |
18:32
π
|
ersi |
There's plenty of people that care JFYI |
18:33
π
|
underscor |
I know |
18:33
π
|
* |
underscor sighs |
18:33
π
|
underscor |
Emotions suck |
18:33
π
|
underscor |
and so does life uncertainty |
18:33
π
|
underscor |
x3 |
18:33
π
|
ersi |
Life is just states of mind |
18:34
π
|
ersi |
IMO |
18:34
π
|
SketchCow |
http://emopotatoe.ytmnd.com/ |
18:35
π
|
underscor |
ooooh |
18:35
π
|
underscor |
what song is that? |
18:35
π
|
underscor |
I like it |
18:35
π
|
underscor |
That progression is orgasmically delicious |
18:36
π
|
ersi |
underscor: Simple Plan is the band |
18:36
π
|
ersi |
"how can this happen to me" is the song |
18:36
π
|
underscor |
sweet, thanks :D |
18:36
π
|
ersi |
np :) |
18:47
π
|
SketchCow |
Killing the BSTJ list |
18:47
π
|
SketchCow |
Going well |
18:47
π
|
SketchCow |
Blasting Lewis black |
18:47
π
|
SketchCow |
the jason scott of comedians |
18:48
π
|
ersi |
:D |
18:48
π
|
ersi |
sounds great |
18:49
π
|
underscor |
hmm, is there a Jason Scott youtube playlist I can give my mom? |
18:50
π
|
SketchCow |
Why would you do that |
18:50
π
|
SketchCow |
She'll call the cops |
18:50
π
|
SketchCow |
She'll call cops that don't even handle domestic situations |
18:50
π
|
godane |
i agree |
18:50
π
|
SketchCow |
http://www.youtube.com/watch?v=ELji4-TogMI |
18:51
π
|
godane |
SketchCow: i uploaded like 3 collections in the last 4 days |
18:52
π
|
godane |
systm, foundation, and giantbomb podcast |
18:54
π
|
underscor |
SketchCow: she wanted to know what you talk about |
18:55
π
|
ersi |
Hah, oh wow - yeah, wget surely doesn't handle filename encodings right |
18:55
π
|
SketchCow |
You. Tell her I talk about you. |
18:56
π
|
ersi |
(C81) (?%90%8CÀºº??%8C) [Lv.X+ (?%9F%9A?%9C?N')] ?%83%95?%83??%82??%82??%83??%83%83?%82??%83? (?%9C??%9D??%97???%98).zip |
18:56
π
|
omf_ |
underscor, the closest thing I know of http://www.archiveteam.org/index.php?title=Talks |
18:56
π
|
ersi |
That's a good filename |
18:56
π
|
underscor |
SketchCow: :D |
18:56
π
|
SketchCow |
http://ascii.textfiles.com/speaking have a ball |
19:03
π
|
SketchCow |
Incredibly boring work continuing |
19:03
π
|
pft |
hurrr yeah |
19:04
π
|
pft |
trying to download stuff from the wayback machine with wget sucks |
19:04
π
|
ersi |
Why not just grab the .warc and then extract from it? |
19:04
π
|
pft |
how do i grab a warc for a site off wayback? |
19:04
π
|
pft |
the site |
19:04
π
|
pft |
er |
19:05
π
|
ersi |
What site is it? |
19:05
π
|
pft |
the site's down and i'm afraid it won't be coming back up, iw ant to keep al ocal copy |
19:05
π
|
pft |
it's a former employer, a site i worked on. i'd like to keep a copy |
19:05
π
|
ersi |
I meant URL :) |
19:05
π
|
pft |
if my former boss had warned me i'd have WARC'd all the sites that he still had up |
19:05
π
|
pft |
http://www.pygmy.com/ is one of a few |
19:05
π
|
ersi |
aight, I'll see if I can dig it out |
19:06
π
|
ersi |
then explain how I got there |
19:06
π
|
omf_ |
http://www.washingtonpost.com/blogs/the-switch/wp/2013/08/18/heres-what-you-find-when-you-scan-the-entire-internet-in-an-hour/ |
19:06
π
|
pft |
thanks <3 |
19:06
π
|
pft |
i'd love to know how to download warcs from wayback |
19:08
π
|
godane |
just watch the speak where jason was with his boss |
19:08
π
|
omf_ |
yeah that one is solid |
19:08
π
|
omf_ |
that was ROLFcon I think |
19:08
π
|
godane |
yes |
19:12
π
|
ersi |
argh, I know I've gotten to a Wayback Machine .WARC somehow |
19:12
π
|
pft |
yeah, i sure couldn't figure out how |
19:13
π
|
pft |
i assumed it wasn't permitted |
19:13
π
|
ersi |
A lot of the data is available in the "Web Crawls" collection (public) |
19:13
π
|
pft |
hmm ok |
19:15
π
|
ersi |
http://web-beta.archive.org/web/*/pygmy.com* |
19:15
π
|
ersi |
That'll be a bit easier, still not WARCs though |
19:16
π
|
pft |
yeah, that is an improvement |
19:17
π
|
SketchCow |
Arcing of Electrical Contacts in Telephone Switching Systems: Part IV - Mechanism of the Initiation of the Short Arc |
19:17
π
|
SketchCow |
It gets better |
19:18
π
|
* |
ersi nods |
19:22
π
|
pft |
wow yeah, i see a lot of the crawl data now but it's not obvious how to find which crawl the site is in |
19:25
π
|
SketchCow |
Adding Bell System Technical Journal articles as well as Manga. |
19:33
π
|
SketchCow |
Bell System Technical Journal, 35: 1 January 1956 pp 179-202. Statistical Techniques for Reducing the Experiment Time in Reliability Studies (Sobel, Milton) |
19:46
π
|
joepie91 |
http://www.theguardian.com/commentisfree/2013/aug/18/david-miranda-detained-uk-nsa?CMP=twt_gu |
20:17
π
|
godane |
so i got skyrim for my birthday |
20:17
π
|
godane |
it is the legendary edition |
20:18
π
|
godane |
i told the guy at gamestop that i hate the download only content cause it will be lost in 5 to 10 years |
20:19
π
|
godane |
cause there will not on a game disc for retro gamers |
20:19
π
|
godane |
and archivers like us |
20:30
π
|
SmileyG |
ooo got invited to torrentleach. |
20:52
π
|
SketchCow |
root@teamarchive0:/0/PLEASUREDOME/MESS 0.149 Software List CHDs# ls |
20:52
π
|
SketchCow |
MESS_0.149_CHD_3do_m2 MESS_0.149_CHD_cdtv MESS_0.149_CHD_megacdj MESS_0.149_CHD_pippin MESS_0.149_CHD_segacd |
20:52
π
|
SketchCow |
Now begins the fun. |
20:52
π
|
SketchCow |
MESS_0.149_CHD_cd32 MESS_0.149_CHD_mac_hdd MESS_0.149_CHD_neocd MESS_0.149_CHD_psx MESS_0.149_CHD_vsmile_cd |
20:52
π
|
SketchCow |
MESS_0.149_CHD_cdi MESS_0.149_CHD_megacd MESS_0.149_CHD_pcecd MESS_0.149_CHD_saturn |
20:53
π
|
SmileyG |
godane: remind me of that torrent site? |
20:53
π
|
SmileyG |
squid? |
20:53
π
|
omf_ |
myspleen |
20:53
π
|
SmileyG |
ty |
20:53
π
|
SmileyG |
my mind went blank |
20:53
π
|
SmileyG |
I have infinate upload xD |
21:04
π
|
ersi |
hah, torrentleach |
21:33
π
|
godane |
i'm uploading how to beat video games |
21:33
π
|
godane |
its from 1982 |
21:35
π
|
godane |
also i may get the original encode version of ancient prophecies |
21:35
π
|
godane |
3 |
22:07
π
|
joepie91 |
looks like anon is getting into the archiving business? |
22:07
π
|
joepie91 |
http://www.slate.com/blogs/future_tense/2013/08/18/martin_manley_s_sister_asks_yahoo_to_put_his_suicide_website_back_up.html |
22:12
π
|
ersi |
but.. doesn't anon forget? |
22:15
π
|
joepie91 |
lol |
22:15
π
|
joepie91 |
also, some interesting info tied to that mirror |
22:19
π
|
ersi |
http://i.imgur.com/bLOcshV.jpg |
22:19
π
|
ersi |
http://i.imgur.com/3FENs7f.jpg |
22:28
π
|
balrog |
ha, yahoo took it down. |
22:28
π
|
balrog |
I'm not surprised one bit. |
22:53
π
|
balrog |
the mirrors appear to be missing this page http://webcache.googleusercontent.com/search?q=cache:SY4Clycsfk0J:martinmanleylifeanddeath.com/june_11_2012+&cd=1&hl=en&ct=clnk&gl=us |
22:59
π
|
balrog |
http://archive.is/1rVuQ |
23:06
π
|
ivan` |
scribd has changed their post-login download page to make it look like you must pay to download something |
23:06
π
|
ivan` |
but if you look at the very bottom there's a tiny link to go to the tit-for-tat upload page |
23:06
π
|
joepie91 |
scribd is so scummy |
23:06
π
|
joepie91 |
in more than one way |
23:16
π
|
SketchCow |
http://www.hackertyper.com/ |
23:16
π
|
SketchCow |
Put that in |
23:16
π
|
SketchCow |
Hit F11 |
23:16
π
|
SketchCow |
Type madly |
23:16
π
|
SketchCow |
INSTANT STREET CRED |
23:29
π
|
godane |
g4tv.com-video54924: PaxTest: http://archive.org/details/g4tv.com-video54924 |
23:29
π
|
godane |
got to love the test videos of g4 |
23:32
π
|
godane |
it gets good about 17 mins in |
23:38
π
|
godane |
uploaded: http://archive.org/details/How_To_Beat_Home_Video_Games_-_Vol.1_The_Best_Games_Vestron_1982 |
23:42
π
|
godane |
uploaded: http://archive.org/details/How_To_Beat_Home_Video_Games_-_Vol.2_The_Hot_New_Games_Vestron_1982 |
23:45
π
|
godane |
uploaded: http://archive.org/details/How_To_Beat_Home_Video_Games_-_Vol.3_Arcade_Quality_for_The_Home_Vestron_1982 |
23:51
π
|
SketchCow |
http://archive.org/movies/thumbnails.php?identifier=How_To_Beat_Home_Video_Games_-_Vol.1_The_Best_Games_Vestron_1982 |
23:59
π
|
godane |
uploaded: http://archive.org/details/Wendys.Grill.Skills.1989.Other.Xvid-CG |