#archiveteam 2014-08-26,Tue

↑back Search

Time Nickname Message
00:02 🔗 dashcloud ia interface is installing now- what command should I use once it's ready?
00:04 🔗 SketchCow It'sthe current ia one but it shouldn't be 0.6.6.
00:04 🔗 SketchCow It's like 0.7.2 or something.
00:04 🔗 SketchCow Or more.
00:04 🔗 SketchCow It's the most recent.
00:08 🔗 dashcloud okay- I'm upgrading it to 0.7.1 now
06:37 🔗 Scuttle hm, is there a channel for the swipnet archiving?
06:40 🔗 Rotab #swiped
06:40 🔗 xmc Scuttle: #swiped
06:41 🔗 Rotab lol
06:42 🔗 xmc exactly
06:47 🔗 Scuttle was thinking I'd set my GBit connection to work...
06:50 🔗 Scuttle hm, the meter in the bottom left corner, is that an indication of how much I have up/downloaded?
06:55 🔗 vantec For the warrior, yes.
15:27 🔗 Entrance Excellent news mates! The wayback machine has working backups of youtube videos now! Anybody got any ideas for a way to just scour youtube and route videos into the waybackmachine?
15:27 🔗 Entrance https://web.archive.org/web/20110804113440/http://www.youtube.com/watch?v=npHWX1dciOE&gl=US&hl=en&has_verified=1 Example number 1 here
15:28 🔗 Entrance I was thinking simply converting the save url into a ip and putting it as a proxy in a spider might work, just set the spider to strictly crawl and not save
15:28 🔗 DFJustin yeah that's existed off and on for a while, afaik there's no way to make them get a specific video
15:28 🔗 DFJustin ...
15:29 🔗 xmc goddamn webchat
15:29 🔗 DFJustin was gonna say, supposedly it grabs every video that gets tweeted but I haven't noticed that to be the case in practice
15:29 🔗 xmc I feel like webchat makes more trouble than it's worth
15:29 🔗 xmc ah, only the ones in the 1% "spritzer" twitter feed
15:29 🔗 DFJustin that would make sense
15:30 🔗 DFJustin but that's not what sketchcow's been telling everyone
15:30 🔗 xmc hm
15:30 🔗 xmc ok
15:32 🔗 DFJustin for whatever reason installing an irc client is a huge barrier for some people, I had to walk someone through using webchat before
15:32 🔗 DFJustin it does seem to be the case that they're not good for much once they finally connect though
15:35 🔗 Jonimus would it be possible to have the Tracker link to the project wiki page along with the website that is being saved and the leaderboard?
15:36 🔗 Jonimus or the warrior status page displayed by runpipeline?
18:01 🔗 SketchCow HEY WHAT
18:02 🔗 juver hey folks
18:02 🔗 SketchCow Hi, juver.
18:02 🔗 SketchCow DFJustin: I found out the policy changed.
19:00 🔗 Emcy do you have a twitter
19:01 🔗 Smiley Emcy: who exactly?
19:01 🔗 Smiley there is @archiveteam and @sketchcow respectively
19:02 🔗 sep332 lol there is no sketchcow
19:02 🔗 Emcy @archiveteam is the one that announces new projects
19:02 🔗 Emcy probably/
19:02 🔗 Emcy ?
19:03 🔗 Emcy i tend to forget i have warrior installed until i read about another site shutting down, then i fire it up
19:03 🔗 Emcy i bet most people with warrior do that
19:06 🔗 Smiley @archiveteam-warrior i think
19:07 🔗 Smiley Emcy: that's fine
19:07 🔗 Smiley to be honest most projects end up with too many people, which is awesome
19:16 🔗 SketchCow SPOON
19:16 🔗 SketchCow Me and the spoon were hanging out.
19:16 🔗 * SketchCow baller
19:17 🔗 Nemo_bis WikiTeam doesn't! We always have space for more
19:18 🔗 Emcy eh i was already following archiveteam
19:18 🔗 Emcy just dont tweet a lo
21:59 🔗 deathy is there any best-practice for archiving email? as in maildir/mbox/others..
22:08 🔗 Emcy tcan i shut this down now
22:08 🔗 Emcy the tracker says 0 to do + 1400 "out"
22:18 🔗 Smiley Emcy: yeah
22:18 🔗 Smiley if you wish :)
22:20 🔗 Emcy ok
22:25 🔗 dashcloud SketchCow: finally got the current IA python setup- how do I grab all the cdbbsarchive images?
22:26 🔗 SketchCow ia search collection:cdbbsarchive
22:26 🔗 SketchCow That returns a list of all items in that collection.
22:26 🔗 SketchCow Do, like: ia search collection:cdbbsarchive | sort -u > hitlist.txt
22:26 🔗 SketchCow So now you have hitlist.txt, which is a nice alphabetic list.
22:28 🔗 SketchCow for each in `cat hitlist.txt`
22:28 🔗 SketchCow do
22:28 🔗 SketchCow ia download $each
22:28 🔗 SketchCow done
22:29 🔗 xmc deathy: tar of maildir is nice.
22:29 🔗 xmc mbox has issues
22:29 🔗 dashcloud is there a way to tell it to only grab jpgs or pngs? (that's all I really want from the collection)
22:32 🔗 Smiley add | grep jpg or png on the end?
22:32 🔗 Smiley well, before the sort
22:32 🔗 Smiley | grep jpg | sort -u > blah
22:44 🔗 SketchCow Smiley: Wrong
22:45 🔗 SketchCow More like:
22:45 🔗 SketchCow ia list $list | grep -i \.[JjGg][PpIi][FfGg]
23:14 🔗 dashcloud thanks SketchCow ! the list of pictures is downloading now (I hope), and I'll grab the actual pictures later

irclogger-viewer