#archiveteam 2011-12-09,Fri

↑back Search

Time Nickname Message
00:26 🔗 chronomex ^
01:01 🔗 VonGuard gamepro is empty
01:01 🔗 VonGuard we cleaned them out SketchCow
01:01 🔗 VonGuard everything is at the museum, being catologed
01:02 🔗 underscor \o/
01:02 🔗 underscor Good news!
01:02 🔗 VonGuard yes indeed
01:05 🔗 RedType finally we;ll be able to heat our cold building
01:05 🔗 RedType grab the matches
01:37 🔗 Coderjoe does that cleaning out include backissues?
01:45 🔗 underscor She's gone
06:56 🔗 dnova urgh
06:56 🔗 dnova one of my HDs just made some funky noises
07:09 🔗 dnova oh well.
07:11 🔗 bsmith093 when SketchCow comes back, i have the poenews data, ill be afk for about 12hrs or so
07:12 🔗 dnova how big is it?
07:12 🔗 dnova is it complete?
07:14 🔗 bsmith093 56,221 items, totalling 509.3 MB and, probably, yes
07:15 🔗 dnova tar.gz that
07:15 🔗 bsmith093 folder warc and cdx tar.gz coming right up
07:16 🔗 dnova I mean, you can go to bed, man
07:16 🔗 dnova it can wait 12 hours :)
07:17 🔗 bsmith093 ok this could take a while, so im off to bed
07:17 🔗 dnova g'night
07:17 🔗 dnova wow this hard drive is definitely unhappy
07:18 🔗 dnova probably time to smash it
07:25 🔗 dnova what the
07:25 🔗 dnova another drive failure?
07:25 🔗 dnova really??
07:26 🔗 godane dnova: your hard drive died?
07:26 🔗 dnova it appears that 2 of them are dead or dying
07:27 🔗 godane that sucks
07:27 🔗 godane you need to get hard drives and there is a hard drive shortage
07:27 🔗 dnova I don't have an imminent need to replace these particular 2
07:28 🔗 godane i'm getting small stuff and burn it to dvd
07:28 🔗 godane i have (i think) the complete archive of crankgeeks
07:31 🔗 godane cgipod-135925-12-10-2008.mp4
07:31 🔗 godane cgipod-138108-01-01-2009.mp4
07:32 🔗 godane these are episode 146 and 148 of crankgeeks
07:32 🔗 godane i'm trying to figure out the diff so i can get episode 147 maybe
07:33 🔗 godane if it exist
07:33 🔗 dnova hmm
07:33 🔗 dnova the 2nd drive appears to be readable/fine but its partition table is fucked
07:33 🔗 godane problem is i have no date to go on
07:34 🔗 dnova I need to run badblocks on this sumbidge
07:38 🔗 dnova 2 hard drive failures at once, haha
07:42 🔗 dnova just a pain in the ass, thankfully (no data lost or in danger)
07:49 🔗 yipdw just replace them with an equivalent capacity in SSDs
07:49 🔗 yipdw problem solved
12:25 🔗 dnova yipdw: haha
12:26 🔗 dnova if you're buying :)
14:09 🔗 dnova looks like at least one of the two drives is ok
14:09 🔗 dnova the ntfs mft was for some reason completely fubar
14:09 🔗 dnova running badblocks on it now.
14:21 🔗 SketchCow BEST MORNING EVER
14:21 🔗 dnova ehhh.
14:22 🔗 SketchCow No, see, here's what's happening.
14:22 🔗 SketchCow I'm adding thousands of issues of French computer magazines, OK?
14:22 🔗 dnova oui, d'accord
14:22 🔗 SketchCow So, a lot of them came from one abandonware site.
14:24 🔗 SketchCow Guess what.... they're FURIOUS
14:24 🔗 dnova er.. furious that you copied the stuff they copied?
14:24 🔗 SketchCow Yes, that exactly.
14:24 🔗 SketchCow That EXACTLY.
14:24 🔗 SketchCow Oh, it is glorious.
14:24 🔗 SketchCow So I'm watching as someone is adding a "review" to every single magazine issue.
14:25 🔗 SketchCow And the review is in angry, angry french.
14:25 🔗 dnova haha
14:25 🔗 dnova do they have ads on their site?
14:25 🔗 SketchCow And it says "This has come from abandonware-magazines.fr (or whatever) and you have stolen it. Please remove it."
14:25 🔗 dnova gotta get those ad impressions
14:28 🔗 SketchCow They're not putting in ads.
14:29 🔗 dnova then what is the problem
14:29 🔗 SketchCow Nothing.
14:30 🔗 dnova well, yeah, but what problem do THEY perceive?
14:30 🔗 SketchCow There's a guy who runs the site, named Fred.
14:30 🔗 SketchCow He has all the cute "just doing this for the good of humanity" thing.
14:30 🔗 SketchCow I am not convinced the person doing these reviews is Fred.
14:31 🔗 SketchCow Which means someone is trying to "help".
14:31 🔗 SketchCow It's no big deal, just sad someone's spending a lot of time doing copy-paste angry reviews.
14:31 🔗 SketchCow Most people would write a letter.
14:34 🔗 SketchCow I guess this person intends to write a thousand.
14:34 🔗 SketchCow Whatever french of aspberger's is.
14:34 🔗 dnova how quickly can you delete them
14:34 🔗 dnova l'aspergers
14:36 🔗 SketchCow Found it. http://forum.abandonware.org/viewtopic.php?f=14&t=3610&p=20648&hilit=archive.org#p20648
14:39 🔗 SketchCow Just wrote Fred.
14:43 🔗 ersi There's no end to the amount of stupid people
14:44 🔗 SketchCow His concern is lack of credit.
14:44 🔗 SketchCow Then he found out I kept the textfile inside each and every .zip with full credit.
14:44 🔗 SketchCow His concern then went to lack of prominent credit.
14:45 🔗 SketchCow Obviously, I'll write a quick routine to go through all of the current and future issues and ensure a default description of "Digitization work by abandonware-magazines.org" or some such.
14:45 🔗 SketchCow I am sure that it is absolutely trivial to say "delete every review by this one account."
14:45 🔗 SketchCow So we're talking a few hours of silliness.
14:46 🔗 ersi should be an easy database query
14:46 🔗 ersi not that I ever want to muck about with a production system with raw SQL
14:46 🔗 ersi blurgh
14:48 🔗 SketchCow Oh, no, I am SURE there are moderation tools. This can't be the first time someone has spammed reviews.
14:49 🔗 SketchCow There's actually some nice tools internally.
14:49 🔗 SketchCow Can do a single operation to massive swaths of archive.org materials, for example.
14:49 🔗 SketchCow Like, find every uploaded issue of "Highlights" and re-derive formats.
14:49 🔗 ersi neat, that's pretty danged mature
14:49 🔗 SketchCow Well, they had to, so many million accounts?
14:50 🔗 ersi true
14:50 🔗 ersi I dunno, a lot of people think only one step. "Ok, someone spammed feature X. I'll make a mod tool to just delete it in this fashion, no other - until need be"
14:51 🔗 SketchCow Well, the manager software has all sorts of things that needed to be done, like adding catalog numbers or adding an additional metadata pair.
14:51 🔗 ersi neat none the less
14:51 🔗 SketchCow "All these issues were edited by Anna Wintour"
14:52 🔗 SketchCow I registered for the forum, waiting for my key.
14:52 🔗 SketchCow So I can pop in there and go "What's all this then harumph" and readjust my monocle.
14:53 🔗 SketchCow "One of your men seems to have gone off the plantation so to speak, haw haw. Perhaps you should reel him in haw haw. (Sucks on pipe)".
14:54 🔗 SketchCow Oh, look at this poor person go.
14:54 🔗 SketchCow I just have no way to communicate with them.
14:54 🔗 ersi start spamming his forum with "I've imported your stuff onto Internet Archive!! "
14:54 🔗 ersi "I've added a mirror!!!!"
14:54 🔗 ersi :)
14:56 🔗 emijrp www.youtube.com/watch?v=ZAigW0Sa1yo
14:57 🔗 emijrp I have found an IA collection dedicated to Spanish Revolution, finally. It is abandoned, I will try to revert that http://www.archive.org/details/centrodedocumentacionciudadana
15:10 🔗 SketchCow Anyway, best day ever
15:12 🔗 DFJustin SketchCow: I managed to track down a couple of japanese magazine pdfs, mind throwing them in computer-magazine-rack or wherever
15:12 🔗 DFJustin http://www.archive.org/details/mycom-basic-magazine-1988-04 http://www.archive.org/details/msx-fan-magazine-1992-12
15:14 🔗 DFJustin for some reason this stuff is way scarcer than english/french/etc
15:14 🔗 SketchCow I'd love that.
15:18 🔗 SketchCow Both are now in the rack.
15:18 🔗 SketchCow It's amazing how few there are, yes, but give us time, man!
15:18 🔗 SketchCow I mean, like I said, I'm hardly at the point of even scanning in original material yet.
15:18 🔗 SketchCow Once THAT happens... hoo boy
15:23 🔗 SketchCow Anyway, just posted to the guy's forum.
15:23 🔗 DFJustin btw here's some more spanish mags http://www.konamito.com/publicaciones-msx/
15:24 🔗 SketchCow It's 4pm in France, should get a response soon.
15:24 🔗 SketchCow Anyway, I will need to get back to REAL work, which includes shoving all the rest of the issues of french magazines up, by the way. :)
15:25 🔗 SketchCow Maybe I'll just finish that off today.
15:27 🔗 SketchCow Batcave is loaded with stuff that can go into archive.org.
15:28 🔗 SketchCow bsmith093: I'll take that copy of poe-news via an rsync if you'd like.
15:28 🔗 SketchCow Everything2 got their dupe of it off of archive.org, by the way
15:28 🔗 SketchCow Expected no less.
15:28 🔗 SketchCow It's stored and dark now, though, so no worries.
15:29 🔗 SketchCow Now, remember, even I know when to be diplomatic
15:30 🔗 SketchCow And not mention that the program I wrote to absorb all the magazines out of their site is named "surrender"
15:30 🔗 DFJustin LOL
15:42 🔗 ersi lol!
16:04 🔗 dnova ahaha
16:25 🔗 godane there is need for a url change on geocites article
16:25 🔗 godane http://blog.greenpirate.org/bittorrent-as-preservation-of-culture/
16:26 🔗 godane the old illunatic.greenpiarte.org doesn't work
17:24 🔗 PepsiMax wat http://www.anyhub.net/
17:24 🔗 PepsiMax alard: http://www.anyhub.net/ WAT
18:00 🔗 Coderjoe one busy beaver
18:00 🔗 Coderjoe Results: 1 through 50 of 114 (0.001 secs)
18:00 🔗 Coderjoe You searched for: reviewer:"canelle"
19:03 🔗 bsmith093 SketchCow: im back and ready for tthe rsync
19:23 🔗 bsmith093 where do i send it
19:26 🔗 pberry bsmith093: which project?
19:27 🔗 bsmith093 the poenews dump for SketchCow
19:28 🔗 pberry I'm sure when he gets back he will have a slot on the batcave for you
19:28 🔗 SketchCow http://www.nbii.gov/portal/server.pt?open=512&objID=236&mode=2&cached=true
19:28 🔗 SketchCow NBII.GOV is being shut down for January 15.
19:28 🔗 SketchCow Top priority.
19:34 🔗 pberry yikes
19:35 🔗 pberry let me put this mobile me stuff on the back burner
19:35 🔗 pberry actually, let me get lunch first ;-)
19:36 🔗 bsmith093 SketchCow: can i get an rsync slot its ~250mb
19:38 🔗 bsmith093 grabbing nbii.gov
19:42 🔗 SketchCow WAIT
19:42 🔗 SketchCow Do NOT grab nbii.gov, bsmith093
19:42 🔗 bsmith093 righ o then thats
19:42 🔗 bsmith093 finr
19:44 🔗 bsmith093 btw, how are you handling these random announcemnts, wget, or something smarter?
19:47 🔗 bsmith093 ok, wow, i just checked what nbii actually is, and , yeah you can take that, it looks like its probably gigantic
19:50 🔗 SketchCow It is.
19:50 🔗 SketchCow It needs coordination, likely warc and wget.
20:05 🔗 DFJustin http://www.gsc-game.com/ this company is reported to be shutting down so if you're itching for a less gigantic site grab it might be worth a go
20:08 🔗 yipdw you'd think the USGS, of all entities, would be receptive to the idea of sending a copy of one of their websites to an archive
20:09 🔗 DFJustin yeah definitely seems worth a go on the diplomatic side but I can easily see red tape holding that up so I would download the shit out of it in the meantime
20:18 🔗 Coderjoe yipdw: except that the USGS usually charges money for retrieving bulk data
20:18 🔗 SketchCow 1. Power cycle the game.
20:18 🔗 SketchCow 2. Allow the demo to run through 11 times.
20:18 🔗 SketchCow 3. On the 12th game demo, insert a quarter at the moment that Mr. Do! eats the center prize.
20:18 🔗 SketchCow Here is how to do it:
20:18 🔗 SketchCow 4. Start a game.
20:18 🔗 SketchCow 5. Loosen the first apple on the bottom right side.
20:18 🔗 SketchCow 6. If everything is done correctly, there will be a diamond when it drops, every time.
20:39 🔗 SketchCow EVERY TIME
20:42 🔗 Coderjoe oh man
20:43 🔗 Coderjoe I need to find a copy of the atari 520st speech synthesizer 2.0 and dig out my atari from storage (and pray it still works)
21:06 🔗 Coderjoe_ well that was awesome
21:07 🔗 Coderjoe_ mess touched my nvidia drivers in a way they didn't like, and they ran off to tell sonic
21:08 🔗 bsmith093 SketchCow: rsync for the poenews ready?
21:08 🔗 Coderjoe_ yes
21:08 🔗 Coderjoe_ you have an rsync slot
21:09 🔗 bsmith093 where?
21:10 🔗 Coderjoe_ check your private messages
21:10 🔗 bsmith093 ok i saw the pm, sending
21:22 🔗 SketchCow I love problems that solve themselves.
21:25 🔗 ersi I used to, but now that I'm a Software Tester - I don't
21:25 🔗 ersi It just means they'll come back later
21:44 🔗 Coderjoe mess even bluescreens the most recent drivers \o/
21:55 🔗 bsmith093 sent to batcave in the poenews directory, its in the targx there
21:57 🔗 bsmith093 yipdw: anything new for ffnet, or are you focusing on the govt archive thats closing down
22:02 🔗 yipdw bsmith093: right now I'm doing what I get paid to do
22:03 🔗 yipdw as far as the fanfiction.net crawler goes, though, I did rework it to be more tolerant of network failures, as well as respecting fanfiction.net's Last-Modified and Cache-Control headers
22:03 🔗 bsmith093 well good for you, the econony thanks you :)
22:09 🔗 yipdw I think fanfiction.net's Last-Modified headers are kinda fucked
23:33 🔗 yipdw bsmith093: FYI, the code in that gist for the ff story discovery thing is up to date
23:33 🔗 yipdw in order to run it, you'll need a Redis instance
23:34 🔗 yipdw I recommend removing the Linux Firefox user_agent_alias bit
23:34 🔗 yipdw because (1) it's not necessary and (2) it's dishonest
23:35 🔗 yipdw I'm running the discovery thing right now
23:35 🔗 yipdw although at a pretty slow pace
23:35 🔗 yipdw I'll let you know what it comes up with; alternatively, just replicate my Redis server's state or something

irclogger-viewer