Time |
Nickname |
Message |
00:02
🔗
|
dashcloud |
ia interface is installing now- what command should I use once it's ready? |
00:04
🔗
|
SketchCow |
It'sthe current ia one but it shouldn't be 0.6.6. |
00:04
🔗
|
SketchCow |
It's like 0.7.2 or something. |
00:04
🔗
|
SketchCow |
Or more. |
00:04
🔗
|
SketchCow |
It's the most recent. |
00:08
🔗
|
dashcloud |
okay- I'm upgrading it to 0.7.1 now |
06:37
🔗
|
Scuttle |
hm, is there a channel for the swipnet archiving? |
06:40
🔗
|
Rotab |
#swiped |
06:40
🔗
|
xmc |
Scuttle: #swiped |
06:41
🔗
|
Rotab |
lol |
06:42
🔗
|
xmc |
exactly |
06:47
🔗
|
Scuttle |
was thinking I'd set my GBit connection to work... |
06:50
🔗
|
Scuttle |
hm, the meter in the bottom left corner, is that an indication of how much I have up/downloaded? |
06:55
🔗
|
vantec |
For the warrior, yes. |
15:27
🔗
|
Entrance |
Excellent news mates! The wayback machine has working backups of youtube videos now! Anybody got any ideas for a way to just scour youtube and route videos into the waybackmachine? |
15:27
🔗
|
Entrance |
https://web.archive.org/web/20110804113440/http://www.youtube.com/watch?v=npHWX1dciOE&gl=US&hl=en&has_verified=1 Example number 1 here |
15:28
🔗
|
Entrance |
I was thinking simply converting the save url into a ip and putting it as a proxy in a spider might work, just set the spider to strictly crawl and not save |
15:28
🔗
|
DFJustin |
yeah that's existed off and on for a while, afaik there's no way to make them get a specific video |
15:28
🔗
|
DFJustin |
... |
15:29
🔗
|
xmc |
goddamn webchat |
15:29
🔗
|
DFJustin |
was gonna say, supposedly it grabs every video that gets tweeted but I haven't noticed that to be the case in practice |
15:29
🔗
|
xmc |
I feel like webchat makes more trouble than it's worth |
15:29
🔗
|
xmc |
ah, only the ones in the 1% "spritzer" twitter feed |
15:29
🔗
|
DFJustin |
that would make sense |
15:30
🔗
|
DFJustin |
but that's not what sketchcow's been telling everyone |
15:30
🔗
|
xmc |
hm |
15:30
🔗
|
xmc |
ok |
15:32
🔗
|
DFJustin |
for whatever reason installing an irc client is a huge barrier for some people, I had to walk someone through using webchat before |
15:32
🔗
|
DFJustin |
it does seem to be the case that they're not good for much once they finally connect though |
15:35
🔗
|
Jonimus |
would it be possible to have the Tracker link to the project wiki page along with the website that is being saved and the leaderboard? |
15:36
🔗
|
Jonimus |
or the warrior status page displayed by runpipeline? |
18:01
🔗
|
SketchCow |
HEY WHAT |
18:02
🔗
|
juver |
hey folks |
18:02
🔗
|
SketchCow |
Hi, juver. |
18:02
🔗
|
SketchCow |
DFJustin: I found out the policy changed. |
19:00
🔗
|
Emcy |
do you have a twitter |
19:01
🔗
|
Smiley |
Emcy: who exactly? |
19:01
🔗
|
Smiley |
there is @archiveteam and @sketchcow respectively |
19:02
🔗
|
sep332 |
lol there is no sketchcow |
19:02
🔗
|
Emcy |
@archiveteam is the one that announces new projects |
19:02
🔗
|
Emcy |
probably/ |
19:02
🔗
|
Emcy |
? |
19:03
🔗
|
Emcy |
i tend to forget i have warrior installed until i read about another site shutting down, then i fire it up |
19:03
🔗
|
Emcy |
i bet most people with warrior do that |
19:06
🔗
|
Smiley |
@archiveteam-warrior i think |
19:07
🔗
|
Smiley |
Emcy: that's fine |
19:07
🔗
|
Smiley |
to be honest most projects end up with too many people, which is awesome |
19:16
🔗
|
SketchCow |
SPOON |
19:16
🔗
|
SketchCow |
Me and the spoon were hanging out. |
19:16
🔗
|
* |
SketchCow baller |
19:17
🔗
|
Nemo_bis |
WikiTeam doesn't! We always have space for more |
19:18
🔗
|
Emcy |
eh i was already following archiveteam |
19:18
🔗
|
Emcy |
just dont tweet a lo |
21:59
🔗
|
deathy |
is there any best-practice for archiving email? as in maildir/mbox/others.. |
22:08
🔗
|
Emcy |
tcan i shut this down now |
22:08
🔗
|
Emcy |
the tracker says 0 to do + 1400 "out" |
22:18
🔗
|
Smiley |
Emcy: yeah |
22:18
🔗
|
Smiley |
if you wish :) |
22:20
🔗
|
Emcy |
ok |
22:25
🔗
|
dashcloud |
SketchCow: finally got the current IA python setup- how do I grab all the cdbbsarchive images? |
22:26
🔗
|
SketchCow |
ia search collection:cdbbsarchive |
22:26
🔗
|
SketchCow |
That returns a list of all items in that collection. |
22:26
🔗
|
SketchCow |
Do, like: ia search collection:cdbbsarchive | sort -u > hitlist.txt |
22:26
🔗
|
SketchCow |
So now you have hitlist.txt, which is a nice alphabetic list. |
22:28
🔗
|
SketchCow |
for each in `cat hitlist.txt` |
22:28
🔗
|
SketchCow |
do |
22:28
🔗
|
SketchCow |
ia download $each |
22:28
🔗
|
SketchCow |
done |
22:29
🔗
|
xmc |
deathy: tar of maildir is nice. |
22:29
🔗
|
xmc |
mbox has issues |
22:29
🔗
|
dashcloud |
is there a way to tell it to only grab jpgs or pngs? (that's all I really want from the collection) |
22:32
🔗
|
Smiley |
add | grep jpg or png on the end? |
22:32
🔗
|
Smiley |
well, before the sort |
22:32
🔗
|
Smiley |
| grep jpg | sort -u > blah |
22:44
🔗
|
SketchCow |
Smiley: Wrong |
22:45
🔗
|
SketchCow |
More like: |
22:45
🔗
|
SketchCow |
ia list $list | grep -i \.[JjGg][PpIi][FfGg] |
23:14
🔗
|
dashcloud |
thanks SketchCow ! the list of pictures is downloading now (I hope), and I'll grab the actual pictures later |