Time |
Nickname |
Message |
00:14
๐
|
Dud1 |
I am tryin to convert รยก to a, but trying to find and replace รยก with \xc3 doesn't work. |
00:19
๐
|
Dud1 |
I can get รยญ replaced by replacing \xed, but รยก won't work. |
00:22
๐
|
DFJustin |
if it's utf-8 you may have to do \xc3\xa1 |
00:34
๐
|
Dud1 |
That didn't work. |
00:37
๐
|
godane |
something very interesting: http://mrtg.cbsig.net/rrd/html/ |
00:37
๐
|
godane |
we now have traffic of cbsnews.com videos |
00:47
๐
|
xmc |
cool |
02:11
๐
|
godane |
var old_date = "20020225";/*Any video before this date will display legacy real video clips: 20, 80 speeds*/ var cut_date = "20031120";/*Any video equal to or greater than this date will get windows media files*/ |
02:11
๐
|
godane |
thats the reason way every before 20031120 can't be found |
02:30
๐
|
DFJustin |
http://imgur.com/a/PETBA |
02:49
๐
|
godane |
so looks like the old real media files on cbsnews disappeared in fall of 2005 i think |
02:50
๐
|
godane |
in early 2005 it wayback machine could get them |
03:12
๐
|
mistym |
The English Language's longest work of literature: Smash Bros fanfiction (https://www.fanfiction.net/s/4112682/) |
03:12
๐
|
mistym |
3,592,814 words in 209 chapters |
03:27
๐
|
dashcloud |
someone cataloged every occurence of computers showing up in Law & Order: http://www.theverge.com/culture/2014/2/3/5373888/machinery-of-justice-20-years-of-computers-on-law-order |
04:33
๐
|
xmc |
dashcloud: more obsessive cataloguing: http://youtu.be/PIGxMENwq1k |
06:35
๐
|
godane |
SketchCow: now it starts: https://archive.org/details/cbsnews.com-video-2003-11-20 |
06:35
๐
|
godane |
i'm doing it this way to keep it neat |
06:43
๐
|
godane |
they most have started the online edition of cbsnews at the very beginning of 2005 it looks like |
06:46
๐
|
arkiver |
I need some help here |
06:47
๐
|
arkiver |
apparently the warc's created by the program https://github.com/odie5533/WarcMiddleware are not well gzipped |
06:47
๐
|
arkiver |
There should be a quick way to fix some of the code the make the warc's work in the wayback machine |
06:48
๐
|
arkiver |
but I'm not experienced with coding, so I don't know how to fix the issue |
06:48
๐
|
chfoo |
specifically the requests should *not* request gzip encoded content |
06:48
๐
|
arkiver |
could someone please take a look at the code and try to find out what needs to be changed? |
06:48
๐
|
arkiver |
I would be very happy about that |
06:49
๐
|
arkiver |
and then I can continue the my opera download |
06:56
๐
|
chfoo |
there should be some sort of magical config in scrapy.cfg or crawltest/settings.py to disable it |
06:57
๐
|
chfoo |
i might be "COMPRESSION_ENABLED = False" |
06:58
๐
|
DFJustin |
https://i.imgur.com/vBgqBBV.jpg |
07:00
๐
|
arkiver |
chfoo: yes, hopefully someone can find out what's wrong with script and how to turn it off, the GZip |
07:12
๐
|
arkiver |
chfoo! |
07:12
๐
|
arkiver |
This one? |
07:12
๐
|
arkiver |
self.use_gzip = True |
07:12
๐
|
arkiver |
:D |
07:15
๐
|
arkiver |
need to go to school... can't test it now |
07:15
๐
|
arkiver |
will do it when I'm back |
07:20
๐
|
godane |
i'm starting my big upload of ImagineFX dvds |
07:20
๐
|
godane |
its about 64gb |
12:11
๐
|
dashcloud |
xmc: looking at the video you passed along, I see this one in the sidebar: http://www.youtube.com/watch?v=ZPoqNeR3_UA Star Trek TNG Ambient Engine Noise (Idling for 24 hrs) - is that the longest Youtube video ever? |
12:23
๐
|
midas |
dashcloud: http://www.youtube.com/watch?v=YwtX4gW3-xU 36 hours long |
12:24
๐
|
midas |
it's so long there are ads during the vid |
13:30
๐
|
midas |
3.8T ftp.tu-chemnitz.de |
13:30
๐
|
midas |
5.0T ftp.uni-erlangen.de |
13:30
๐
|
midas |
671G ftp.uni-muenster.de |
13:30
๐
|
midas |
8.8G ftp.warwick.ac.uk |
13:30
๐
|
midas |
429G gatekeeper.dec.com |
13:32
๐
|
midas |
still not done... |
14:11
๐
|
GLaDOS |
ah shit, i forgot to renew archivingyoursh.it |
14:12
๐
|
GLaDOS |
ugh, i cant get into the account for it |
14:12
๐
|
GLaDOS |
ill do it tomorrow |
14:26
๐
|
midas |
ovh box? |
15:14
๐
|
joepie91 |
midas: it's about the domain |
15:14
๐
|
joepie91 |
not a server |
15:14
๐
|
joepie91 |
:P |
15:14
๐
|
joepie91 |
GLaDOS: should I remind you tomorrow? not sure how good you are at mental todo lists |
15:14
๐
|
joepie91 |
actually |
15:14
๐
|
joepie91 |
.in 1d GLaDOS: renew archivingyoursh.it |
15:14
๐
|
botpie91 |
joepie91: Okay, will remind on 05 Feb 2014 at 15:14Z |
15:14
๐
|
joepie91 |
:P |
15:14
๐
|
joepie91 |
nothing beats a bot, in the field of todo lists! |
15:20
๐
|
midas |
lol |
15:30
๐
|
ersi |
well, a netsplit would beat it |
15:50
๐
|
Smiley |
nothing beats graffiti :D |
15:57
๐
|
godane |
SketchCow: i found pdf transcripts of face the nation |
18:46
๐
|
chfoo |
not sure if this was mentioned already: http://chronicle.com/blogs/profhacker/why-not-spare-a-little-bandwidth-for-the-archive-team/55071 |
18:56
๐
|
joepie91 |
"It also throttles downloads of the material to limit overloading the dying service." |
18:56
๐
|
joepie91 |
haha |
19:13
๐
|
yipdw |
goddamnit why did I click the Disqus link |
19:19
๐
|
joepie91 |
yipdw: Disqus is rapidly becoming the IE of comments systems |
19:19
๐
|
joepie91 |
*accidentally click IE shortcut on taskbar* |
19:19
๐
|
joepie91 |
OH GOD NO |
19:19
๐
|
joepie91 |
*frantically tries to get out of IE starting* |
19:19
๐
|
joepie91 |
WHY DID I DO THAT |
19:19
๐
|
joepie91 |
etc. |
19:19
๐
|
yipdw |
yeah |
19:19
๐
|
yipdw |
luckily Ghostery usually blocks it |
19:20
๐
|
yipdw |
but in this case I had to get all curious |
19:20
๐
|
turnip |
Hooray for ghostery |
19:32
๐
|
Schbirid |
i somehow broke disqus on my system but i dont mind at all |
20:33
๐
|
ersi |
If a site uses Disqus, I won't comment on that site |
20:33
๐
|
ersi |
'cause it's disqusting |
20:34
๐
|
ersi |
Haha, who made the picture @ http://chronicle.com/blogs/profhacker/why-not-spare-a-little-bandwidth-for-the-archive-team/55071 |
20:34
๐
|
ersi |
it's awesome |
20:43
๐
|
DFJustin |
so what happened with jason's new york library smackdown or do we have to wait for the statute of limitations to run out first |
20:45
๐
|
midas |
what about government run archives? |
20:45
๐
|
midas |
should we trust that? |
20:46
๐
|
midas |
UK government, great example |
20:46
๐
|
midas |
http://www.nationalarchives.gov.uk/webarchive/ |
20:46
๐
|
midas |
or thailand, not really a archive but it's near a faultline and could be flooded |
20:47
๐
|
DFJustin |
with government the main concern is deliberate destruction and there is plenty of precedent on that |
20:57
๐
|
joepie91 |
man |
20:57
๐
|
joepie91 |
lowendtalk seems to be suffering from a bad case of the edits right now |
20:57
๐
|
joepie91 |
topic titles being edited by mods left, right and center |
20:57
๐
|
joepie91 |
to make them more politically correct |
20:57
๐
|
joepie91 |
(changing it into stuff like "misunderstanding blah blah - got refunded") |
20:59
๐
|
ersi |
guess it's hurting their relations with the crappy VPS providers |
20:59
๐
|
joepie91 |
even one where "allegations of" was prefixed |
21:03
๐
|
ersi |
they should add "political correct version:" as a prefix ;D |
21:26
๐
|
xmc |
ersi: I think someone here made it a long time ago |
21:27
๐
|
Smiley |
yup looks like it built ok :) |
21:40
๐
|
yipdw |
ersi: chfoo |
21:41
๐
|
yipdw |
at least according to archiveteam.org's change tracking |
21:41
๐
|
yipdw |
it's possible someone else did it |
21:47
๐
|
DFJustin |
oh https://archive.org/details/DigiBarn has started adding materials again |
21:57
๐
|
chfoo |
ersi: i made it. source file: https://github.com/chfoo/cloaked-octo-nemesis/blob/master/dev-docs/archiveteam_warrior_infrastructure.svg |
22:00
๐
|
ersi |
It's awesome. |
22:01
๐
|
SketchCow |
DFJustin: I gave them two days on account of snow |
22:01
๐
|
SketchCow |
Was also waiting to make sure Internet Archive didn't hit them first, I try to avoid muddying the pond |
22:32
๐
|
DFJustin |
aw |