Time |
Nickname |
Message |
01:42
π
|
SketchCow |
Oh man, we are NOT in good shape to take over matchmaking. |
01:42
π
|
SketchCow |
I really want short urls to be fixed soon. |
01:42
π
|
SketchCow |
19:44 < fellowshi> Everyone shuts down Gameservers all the Time. |
01:42
π
|
SketchCow |
This is a misrepresentation |
01:43
π
|
SketchCow |
Maybe 5 get shut off a year |
02:18
π
|
anarllama |
hi there |
02:18
π
|
anarllama |
I have an old Eee PC |
02:18
π
|
anarllama |
IΓ’ΒΒve installed fedora on it, but I was wondering what the best distro would be |
02:18
π
|
anarllama |
to archive stuff |
02:18
π
|
anarllama |
(including running the Warrior) |
02:31
π
|
SketchCow |
Any is fine, we just use a virtual box instance on top of your fun. |
02:31
π
|
SketchCow |
If you want to get down and dirty and run the script we run, then it should still be fine. |
02:40
π
|
anarllama |
you canΓ’ΒΒt run the warrior as an OS ? |
02:51
π
|
dashcloud |
I thought I was pretty clear on the point that doing the matchmaking would be HARD- should I have been more blunt about it? |
03:06
π
|
SketchCow |
Argue matchmaking in -bs |
03:06
π
|
SketchCow |
But yes, matchmaking as an enterprise should be done by an organization othr than archive team |
03:06
π
|
SketchCow |
We're not in a position, nor should we be, to provide vital internet services |
03:07
π
|
SketchCow |
We're good at gathering the data that someone else running vital internet services might need |
03:07
π
|
SketchCow |
i.e. wayback or upcoming.org |
03:32
π
|
yipdw |
anarllama: you can; there's a Docker image for that |
03:32
π
|
yipdw |
https://github.com/ArchiveTeam/warrior-dockerfile |
04:23
π
|
SketchCow |
I'm starting to think I'm one of the only people who uses screen + irc |
09:09
π
|
danneh_ |
Just wondering, if I'm backing up several government websites (agencies that are gonna be axed with Australia's new budget), should I upload each warc'd site as a separate item in IA or just upload all the warcs as a single IA item? |
09:12
π
|
godane |
upload them as separate item |
09:13
π
|
danneh_ |
awesome, will do. thanks! |
09:49
π
|
trs80 |
danneh: oog, good call |
09:49
π
|
trs80 |
danneh: let me know if you need some help |
14:33
π
|
ArhiveBot |
!archive http://www.nebraskaweatherphotos.org/ |
14:34
π
|
SketchCow |
What..... is that. |
14:35
π
|
Smiley |
wrong chan? |
14:39
π
|
yipdw |
ok, that's a bot |
15:35
π
|
Wabadub |
ok, i downloaded the 21gb twop archive and it makes warcqtviewer go unresponsive. can it handle such big warc files? |
16:08
π
|
Smiley |
and all those people saying "I have CD backups are now feeling sheepish... -> |
16:08
π
|
Smiley |
http://www.theatlantic.com/technology/archive/2014/05/the-library-of-congress-wants-to-destroy-your-old-cds-for-science/370804/ |
16:55
π
|
SadDM |
Long lost commentary tracks from recalled laserdisc releases of James Bond movies... check: https://archive.org/details/from_russia_with_love-criterion_laserdisc-commentary_track |
16:57
π
|
SketchCow |
https://twitter.com/textfiles/status/466618491207176193 |
16:57
π
|
SketchCow |
Well, that went better than expected |
17:10
π
|
midas |
my provider is stopping it's homepage service, http://home.xmsnet.nl/username what would be the best way to provide? |
17:10
π
|
midas |
google says about 9000 pages |
17:11
π
|
SadDM |
SketchCow: He's always been kind of a dick... on behalf of the rest of Canada I'd like to aplologize. |
17:12
π
|
yipdw |
I don't think Jian Ghomeshi is a dick, he's just stupid |
17:13
π
|
midas |
lets archive that fragment forever |
17:14
π
|
SadDM |
get on it! |
17:14
π
|
SadDM |
:-D |
17:14
π
|
midas |
and after that, im going to put a sticky bit on it and try to delete it a couple of thousand times |
17:15
π
|
exmic |
ha |
17:20
π
|
yipdw |
midas: done |
17:20
π
|
SketchCow |
Punch a DJ in the face, doo dah, doo dah |
17:22
π
|
midas |
http://thumbnails.cbc.ca/maven_legacy/thumbnails/15/449/qpodcast_20140514_14626_uploaded.mp3 |
17:23
π
|
midas |
for how also wants to grab the bastard and delete it a couple of times, just to be sure |
17:23
π
|
midas |
who* |
17:23
π
|
yipdw |
midas: archivebot got it |
17:23
π
|
midas |
perfect |
17:23
π
|
yipdw |
SketchCow: you may also want to retweet the thumbnails.cbc.ca link; the podcasts.* link actually doesn't exist |
17:25
π
|
midas |
anyway, about my provider |
17:25
π
|
SketchCow |
Nah, it's just me having fun. |
17:25
π
|
midas |
they are stopping the homepage service |
17:25
π
|
midas |
any idea's how to grab these ~9000 sites? |
17:25
π
|
yipdw |
wget |
17:26
π
|
DFJustin |
make a list of user names, ???, warrior project, profit? |
17:26
π
|
yipdw |
alternatively if you have a bunch of URLs and they're all self-contained you can shove them all into archivebot |
17:26
π
|
midas |
yipdw: you know what really makes me sad? this: http://home.xmsnet.nl/berendbotje/ |
17:26
π
|
yipdw |
ha |
17:27
π
|
midas |
this guy put up a link to his archive, a frigging archive, and then dyndns fucking killed the free service. |
17:27
π
|
midas |
that's just cruel |
17:29
π
|
schbirid |
does someone know an existing script/tool to keep archives of online source code repositories? i would have a list of repos and want them to update daily |
17:29
π
|
schbirid |
easy scripting task but hey, no need to replicate if it has already been written |
17:31
π
|
DFJustin |
http://urlm.co/www.atcarchive.dyndns.org has an ip address but it doesn't respond to http |
17:32
π
|
DFJustin |
someone wanna portscan the block lol |
17:32
π
|
midas |
lol :p |
17:32
π
|
midas |
xms is a provider with alot of FTTH connections, so yeah, probably alot of servers available |
17:33
π
|
DFJustin |
or you could tweet him https://twitter.com/ATCArchive |
18:15
π
|
schbirid |
wrote it myself, quick and ugly and buggy https://github.com/SpiritQuaddicted/quake-code-archives |
18:32
π
|
ATZ0 |
East Village Radio signing off - http://evgrieve.com/2014/05/exclusive-east-village-radio-is-signing.html - Website and Show archives here - http://www.eastvillageradio.com/ |
18:40
π
|
midas |
right, so 8000 urls are actually about 600 websites |
18:40
π
|
midas |
according to my sorting and such |
18:41
π
|
ATZ0 |
"All of our archives will be available, eventually ...." |
18:58
π
|
SadDM |
ATZ0: that's a whole lot of data |
18:59
π
|
ATZ0 |
Hasn't stopped us before. |
18:59
π
|
SadDM |
yeah... I guess I was just thinking that it would be big enough to warrant a group project |
22:16
π
|
wp494 |
re. what ATZ0 brought up: time to unleash archivebot on it |
22:16
π
|
wp494 |
in other news: !!! |
22:16
π
|
wp494 |
http://www.cbc.ca/news/business/yahoo-buys-snapchat-rival-blink-1.2642954 |
22:20
π
|
ATZ0 |
there's some flash audio players involved, not sure how that complicates things. |
22:32
π
|
SadDM |
I'm in the process of tracking down the urls of all of the mp3s, but it's going to take a while. |
22:33
π
|
SadDM |
Even most of the content on the pages is JS generated, so archivebot isn't going to get much |
22:57
π
|
Baljem |
I thought Yahoo were already a Snapchat rival? you upload your precious data, and a few years later they delete it... |
23:00
π
|
garyrh |
yes, but now they'll delete it much faster. |
23:11
π
|
exmic |
hah |
23:12
π
|
SadDM |
I'm going to try downloading the first 252 East Village Radio shows yo see if I get banned... we can go from there. |
23:24
π
|
ATZ0 |
all i ask is that if we unleash warrior on it, the project be called Village People |
23:24
π
|
ATZ0 |
we want you, we want you, we want you as a new virtual machine automated website archiving recruit. |
23:37
π
|
Baljem |
I'm not /totally/ convinced that quite fits the rhythm, but apart from that... |
23:44
π
|
SadDM |
aw... I thought that EVR is staffed by terrible hipsters, we could do something like PBR. |
23:45
π
|
SadDM |
Anyway, EVR is going to be huge. My back of the envelope math puts it at about 1.25TB |
23:47
π
|
SadDM |
Is there anybody that could talk to me about setting up a warrior project? |