| Time |
Nickname |
Message |
|
00:02
🔗
|
dashcloud |
ia interface is installing now- what command should I use once it's ready? |
|
00:04
🔗
|
SketchCow |
It'sthe current ia one but it shouldn't be 0.6.6. |
|
00:04
🔗
|
SketchCow |
It's like 0.7.2 or something. |
|
00:04
🔗
|
SketchCow |
Or more. |
|
00:04
🔗
|
SketchCow |
It's the most recent. |
|
00:08
🔗
|
dashcloud |
okay- I'm upgrading it to 0.7.1 now |
|
06:37
🔗
|
Scuttle |
hm, is there a channel for the swipnet archiving? |
|
06:40
🔗
|
Rotab |
#swiped |
|
06:40
🔗
|
xmc |
Scuttle: #swiped |
|
06:41
🔗
|
Rotab |
lol |
|
06:42
🔗
|
xmc |
exactly |
|
06:47
🔗
|
Scuttle |
was thinking I'd set my GBit connection to work... |
|
06:50
🔗
|
Scuttle |
hm, the meter in the bottom left corner, is that an indication of how much I have up/downloaded? |
|
06:55
🔗
|
vantec |
For the warrior, yes. |
|
15:27
🔗
|
Entrance |
Excellent news mates! The wayback machine has working backups of youtube videos now! Anybody got any ideas for a way to just scour youtube and route videos into the waybackmachine? |
|
15:27
🔗
|
Entrance |
https://web.archive.org/web/20110804113440/http://www.youtube.com/watch?v=npHWX1dciOE&gl=US&hl=en&has_verified=1 Example number 1 here |
|
15:28
🔗
|
Entrance |
I was thinking simply converting the save url into a ip and putting it as a proxy in a spider might work, just set the spider to strictly crawl and not save |
|
15:28
🔗
|
DFJustin |
yeah that's existed off and on for a while, afaik there's no way to make them get a specific video |
|
15:28
🔗
|
DFJustin |
... |
|
15:29
🔗
|
xmc |
goddamn webchat |
|
15:29
🔗
|
DFJustin |
was gonna say, supposedly it grabs every video that gets tweeted but I haven't noticed that to be the case in practice |
|
15:29
🔗
|
xmc |
I feel like webchat makes more trouble than it's worth |
|
15:29
🔗
|
xmc |
ah, only the ones in the 1% "spritzer" twitter feed |
|
15:29
🔗
|
DFJustin |
that would make sense |
|
15:30
🔗
|
DFJustin |
but that's not what sketchcow's been telling everyone |
|
15:30
🔗
|
xmc |
hm |
|
15:30
🔗
|
xmc |
ok |
|
15:32
🔗
|
DFJustin |
for whatever reason installing an irc client is a huge barrier for some people, I had to walk someone through using webchat before |
|
15:32
🔗
|
DFJustin |
it does seem to be the case that they're not good for much once they finally connect though |
|
15:35
🔗
|
Jonimus |
would it be possible to have the Tracker link to the project wiki page along with the website that is being saved and the leaderboard? |
|
15:36
🔗
|
Jonimus |
or the warrior status page displayed by runpipeline? |
|
18:01
🔗
|
SketchCow |
HEY WHAT |
|
18:02
🔗
|
juver |
hey folks |
|
18:02
🔗
|
SketchCow |
Hi, juver. |
|
18:02
🔗
|
SketchCow |
DFJustin: I found out the policy changed. |
|
19:00
🔗
|
Emcy |
do you have a twitter |
|
19:01
🔗
|
Smiley |
Emcy: who exactly? |
|
19:01
🔗
|
Smiley |
there is @archiveteam and @sketchcow respectively |
|
19:02
🔗
|
sep332 |
lol there is no sketchcow |
|
19:02
🔗
|
Emcy |
@archiveteam is the one that announces new projects |
|
19:02
🔗
|
Emcy |
probably/ |
|
19:02
🔗
|
Emcy |
? |
|
19:03
🔗
|
Emcy |
i tend to forget i have warrior installed until i read about another site shutting down, then i fire it up |
|
19:03
🔗
|
Emcy |
i bet most people with warrior do that |
|
19:06
🔗
|
Smiley |
@archiveteam-warrior i think |
|
19:07
🔗
|
Smiley |
Emcy: that's fine |
|
19:07
🔗
|
Smiley |
to be honest most projects end up with too many people, which is awesome |
|
19:16
🔗
|
SketchCow |
SPOON |
|
19:16
🔗
|
SketchCow |
Me and the spoon were hanging out. |
|
19:16
🔗
|
* |
SketchCow baller |
|
19:17
🔗
|
Nemo_bis |
WikiTeam doesn't! We always have space for more |
|
19:18
🔗
|
Emcy |
eh i was already following archiveteam |
|
19:18
🔗
|
Emcy |
just dont tweet a lo |
|
21:59
🔗
|
deathy |
is there any best-practice for archiving email? as in maildir/mbox/others.. |
|
22:08
🔗
|
Emcy |
tcan i shut this down now |
|
22:08
🔗
|
Emcy |
the tracker says 0 to do + 1400 "out" |
|
22:18
🔗
|
Smiley |
Emcy: yeah |
|
22:18
🔗
|
Smiley |
if you wish :) |
|
22:20
🔗
|
Emcy |
ok |
|
22:25
🔗
|
dashcloud |
SketchCow: finally got the current IA python setup- how do I grab all the cdbbsarchive images? |
|
22:26
🔗
|
SketchCow |
ia search collection:cdbbsarchive |
|
22:26
🔗
|
SketchCow |
That returns a list of all items in that collection. |
|
22:26
🔗
|
SketchCow |
Do, like: ia search collection:cdbbsarchive | sort -u > hitlist.txt |
|
22:26
🔗
|
SketchCow |
So now you have hitlist.txt, which is a nice alphabetic list. |
|
22:28
🔗
|
SketchCow |
for each in `cat hitlist.txt` |
|
22:28
🔗
|
SketchCow |
do |
|
22:28
🔗
|
SketchCow |
ia download $each |
|
22:28
🔗
|
SketchCow |
done |
|
22:29
🔗
|
xmc |
deathy: tar of maildir is nice. |
|
22:29
🔗
|
xmc |
mbox has issues |
|
22:29
🔗
|
dashcloud |
is there a way to tell it to only grab jpgs or pngs? (that's all I really want from the collection) |
|
22:32
🔗
|
Smiley |
add | grep jpg or png on the end? |
|
22:32
🔗
|
Smiley |
well, before the sort |
|
22:32
🔗
|
Smiley |
| grep jpg | sort -u > blah |
|
22:44
🔗
|
SketchCow |
Smiley: Wrong |
|
22:45
🔗
|
SketchCow |
More like: |
|
22:45
🔗
|
SketchCow |
ia list $list | grep -i \.[JjGg][PpIi][FfGg] |
|
23:14
🔗
|
dashcloud |
thanks SketchCow ! the list of pictures is downloading now (I hope), and I'll grab the actual pictures later |