#archiveteam-bs 2014-02-04,Tue

โ†‘back Search

Time Nickname Message
00:14 ๐Ÿ”— Dud1 I am tryin to convert รƒยก to a, but trying to find and replace รƒยก with \xc3 doesn't work.
00:19 ๐Ÿ”— Dud1 I can get รƒยญ replaced by replacing \xed, but รƒยก won't work.
00:22 ๐Ÿ”— DFJustin if it's utf-8 you may have to do \xc3\xa1
00:34 ๐Ÿ”— Dud1 That didn't work.
00:37 ๐Ÿ”— godane something very interesting: http://mrtg.cbsig.net/rrd/html/
00:37 ๐Ÿ”— godane we now have traffic of cbsnews.com videos
00:47 ๐Ÿ”— xmc cool
02:11 ๐Ÿ”— godane var old_date = "20020225";/*Any video before this date will display legacy real video clips: 20, 80 speeds*/ var cut_date = "20031120";/*Any video equal to or greater than this date will get windows media files*/
02:11 ๐Ÿ”— godane thats the reason way every before 20031120 can't be found
02:30 ๐Ÿ”— DFJustin http://imgur.com/a/PETBA
02:49 ๐Ÿ”— godane so looks like the old real media files on cbsnews disappeared in fall of 2005 i think
02:50 ๐Ÿ”— godane in early 2005 it wayback machine could get them
03:12 ๐Ÿ”— mistym The English Language's longest work of literature: Smash Bros fanfiction (https://www.fanfiction.net/s/4112682/)
03:12 ๐Ÿ”— mistym 3,592,814 words in 209 chapters
03:27 ๐Ÿ”— dashcloud someone cataloged every occurence of computers showing up in Law & Order: http://www.theverge.com/culture/2014/2/3/5373888/machinery-of-justice-20-years-of-computers-on-law-order
04:33 ๐Ÿ”— xmc dashcloud: more obsessive cataloguing: http://youtu.be/PIGxMENwq1k
06:35 ๐Ÿ”— godane SketchCow: now it starts: https://archive.org/details/cbsnews.com-video-2003-11-20
06:35 ๐Ÿ”— godane i'm doing it this way to keep it neat
06:43 ๐Ÿ”— godane they most have started the online edition of cbsnews at the very beginning of 2005 it looks like
06:46 ๐Ÿ”— arkiver I need some help here
06:47 ๐Ÿ”— arkiver apparently the warc's created by the program https://github.com/odie5533/WarcMiddleware are not well gzipped
06:47 ๐Ÿ”— arkiver There should be a quick way to fix some of the code the make the warc's work in the wayback machine
06:48 ๐Ÿ”— arkiver but I'm not experienced with coding, so I don't know how to fix the issue
06:48 ๐Ÿ”— chfoo specifically the requests should *not* request gzip encoded content
06:48 ๐Ÿ”— arkiver could someone please take a look at the code and try to find out what needs to be changed?
06:48 ๐Ÿ”— arkiver I would be very happy about that
06:49 ๐Ÿ”— arkiver and then I can continue the my opera download
06:56 ๐Ÿ”— chfoo there should be some sort of magical config in scrapy.cfg or crawltest/settings.py to disable it
06:57 ๐Ÿ”— chfoo i might be "COMPRESSION_ENABLED = False"
06:58 ๐Ÿ”— DFJustin https://i.imgur.com/vBgqBBV.jpg
07:00 ๐Ÿ”— arkiver chfoo: yes, hopefully someone can find out what's wrong with script and how to turn it off, the GZip
07:12 ๐Ÿ”— arkiver chfoo!
07:12 ๐Ÿ”— arkiver This one?
07:12 ๐Ÿ”— arkiver self.use_gzip = True
07:12 ๐Ÿ”— arkiver :D
07:15 ๐Ÿ”— arkiver need to go to school... can't test it now
07:15 ๐Ÿ”— arkiver will do it when I'm back
07:20 ๐Ÿ”— godane i'm starting my big upload of ImagineFX dvds
07:20 ๐Ÿ”— godane its about 64gb
12:11 ๐Ÿ”— dashcloud xmc: looking at the video you passed along, I see this one in the sidebar: http://www.youtube.com/watch?v=ZPoqNeR3_UA Star Trek TNG Ambient Engine Noise (Idling for 24 hrs) - is that the longest Youtube video ever?
12:23 ๐Ÿ”— midas dashcloud: http://www.youtube.com/watch?v=YwtX4gW3-xU 36 hours long
12:24 ๐Ÿ”— midas it's so long there are ads during the vid
13:30 ๐Ÿ”— midas 3.8T ftp.tu-chemnitz.de
13:30 ๐Ÿ”— midas 5.0T ftp.uni-erlangen.de
13:30 ๐Ÿ”— midas 671G ftp.uni-muenster.de
13:30 ๐Ÿ”— midas 8.8G ftp.warwick.ac.uk
13:30 ๐Ÿ”— midas 429G gatekeeper.dec.com
13:32 ๐Ÿ”— midas still not done...
14:11 ๐Ÿ”— GLaDOS ah shit, i forgot to renew archivingyoursh.it
14:12 ๐Ÿ”— GLaDOS ugh, i cant get into the account for it
14:12 ๐Ÿ”— GLaDOS ill do it tomorrow
14:26 ๐Ÿ”— midas ovh box?
15:14 ๐Ÿ”— joepie91 midas: it's about the domain
15:14 ๐Ÿ”— joepie91 not a server
15:14 ๐Ÿ”— joepie91 :P
15:14 ๐Ÿ”— joepie91 GLaDOS: should I remind you tomorrow? not sure how good you are at mental todo lists
15:14 ๐Ÿ”— joepie91 actually
15:14 ๐Ÿ”— joepie91 .in 1d GLaDOS: renew archivingyoursh.it
15:14 ๐Ÿ”— botpie91 joepie91: Okay, will remind on 05 Feb 2014 at 15:14Z
15:14 ๐Ÿ”— joepie91 :P
15:14 ๐Ÿ”— joepie91 nothing beats a bot, in the field of todo lists!
15:20 ๐Ÿ”— midas lol
15:30 ๐Ÿ”— ersi well, a netsplit would beat it
15:50 ๐Ÿ”— Smiley nothing beats graffiti :D
15:57 ๐Ÿ”— godane SketchCow: i found pdf transcripts of face the nation
18:46 ๐Ÿ”— chfoo not sure if this was mentioned already: http://chronicle.com/blogs/profhacker/why-not-spare-a-little-bandwidth-for-the-archive-team/55071
18:56 ๐Ÿ”— joepie91 "It also throttles downloads of the material to limit overloading the dying service."
18:56 ๐Ÿ”— joepie91 haha
19:13 ๐Ÿ”— yipdw goddamnit why did I click the Disqus link
19:19 ๐Ÿ”— joepie91 yipdw: Disqus is rapidly becoming the IE of comments systems
19:19 ๐Ÿ”— joepie91 *accidentally click IE shortcut on taskbar*
19:19 ๐Ÿ”— joepie91 OH GOD NO
19:19 ๐Ÿ”— joepie91 *frantically tries to get out of IE starting*
19:19 ๐Ÿ”— joepie91 WHY DID I DO THAT
19:19 ๐Ÿ”— joepie91 etc.
19:19 ๐Ÿ”— yipdw yeah
19:19 ๐Ÿ”— yipdw luckily Ghostery usually blocks it
19:20 ๐Ÿ”— yipdw but in this case I had to get all curious
19:20 ๐Ÿ”— turnip Hooray for ghostery
19:32 ๐Ÿ”— Schbirid i somehow broke disqus on my system but i dont mind at all
20:33 ๐Ÿ”— ersi If a site uses Disqus, I won't comment on that site
20:33 ๐Ÿ”— ersi 'cause it's disqusting
20:34 ๐Ÿ”— ersi Haha, who made the picture @ http://chronicle.com/blogs/profhacker/why-not-spare-a-little-bandwidth-for-the-archive-team/55071
20:34 ๐Ÿ”— ersi it's awesome
20:43 ๐Ÿ”— DFJustin so what happened with jason's new york library smackdown or do we have to wait for the statute of limitations to run out first
20:45 ๐Ÿ”— midas what about government run archives?
20:45 ๐Ÿ”— midas should we trust that?
20:46 ๐Ÿ”— midas UK government, great example
20:46 ๐Ÿ”— midas http://www.nationalarchives.gov.uk/webarchive/
20:46 ๐Ÿ”— midas or thailand, not really a archive but it's near a faultline and could be flooded
20:47 ๐Ÿ”— DFJustin with government the main concern is deliberate destruction and there is plenty of precedent on that
20:57 ๐Ÿ”— joepie91 man
20:57 ๐Ÿ”— joepie91 lowendtalk seems to be suffering from a bad case of the edits right now
20:57 ๐Ÿ”— joepie91 topic titles being edited by mods left, right and center
20:57 ๐Ÿ”— joepie91 to make them more politically correct
20:57 ๐Ÿ”— joepie91 (changing it into stuff like "misunderstanding blah blah - got refunded")
20:59 ๐Ÿ”— ersi guess it's hurting their relations with the crappy VPS providers
20:59 ๐Ÿ”— joepie91 even one where "allegations of" was prefixed
21:03 ๐Ÿ”— ersi they should add "political correct version:" as a prefix ;D
21:26 ๐Ÿ”— xmc ersi: I think someone here made it a long time ago
21:27 ๐Ÿ”— Smiley yup looks like it built ok :)
21:40 ๐Ÿ”— yipdw ersi: chfoo
21:41 ๐Ÿ”— yipdw at least according to archiveteam.org's change tracking
21:41 ๐Ÿ”— yipdw it's possible someone else did it
21:47 ๐Ÿ”— DFJustin oh https://archive.org/details/DigiBarn has started adding materials again
21:57 ๐Ÿ”— chfoo ersi: i made it. source file: https://github.com/chfoo/cloaked-octo-nemesis/blob/master/dev-docs/archiveteam_warrior_infrastructure.svg
22:00 ๐Ÿ”— ersi It's awesome.
22:01 ๐Ÿ”— SketchCow DFJustin: I gave them two days on account of snow
22:01 ๐Ÿ”— SketchCow Was also waiting to make sure Internet Archive didn't hit them first, I try to avoid muddying the pond
22:32 ๐Ÿ”— DFJustin aw

irclogger-viewer