Time |
Nickname |
Message |
00:05
🔗
|
|
zenguy_pc has joined #archiveteam |
00:14
🔗
|
|
vOYtEC has joined #archiveteam |
00:20
🔗
|
|
phuzion has quit IRC (Read error: Operation timed out) |
00:39
🔗
|
arkiver |
Atluxity: they are need when we have a rsync for screenr |
00:39
🔗
|
arkiver |
I'll then start the project |
00:50
🔗
|
|
cvb has quit IRC (Ping timeout: 255 seconds) |
01:09
🔗
|
|
Selanda has quit IRC (Read error: Operation timed out) |
01:18
🔗
|
|
Selanda has joined #archiveteam |
01:50
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
01:51
🔗
|
|
BlueMaxim has joined #archiveteam |
01:52
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
01:56
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
02:01
🔗
|
|
schbirid2 has joined #archiveteam |
02:02
🔗
|
|
BlueMaxim has joined #archiveteam |
02:03
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
02:05
🔗
|
|
zenguy_pc has joined #archiveteam |
02:05
🔗
|
|
primus105 has quit IRC (Leaving.) |
02:11
🔗
|
|
phuzion has joined #archiveteam |
02:15
🔗
|
chfoo |
arkiver: ok, done |
02:24
🔗
|
SketchCow |
I don't know what I am doing |
02:24
🔗
|
SketchCow |
chfoo: how is it done |
02:25
🔗
|
chfoo |
i just create the directory for the project in my warrior directory and then add the url of the new path in the tracker |
02:59
🔗
|
|
username1 has joined #archiveteam |
03:01
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
03:08
🔗
|
SketchCow |
Got it |
03:08
🔗
|
SketchCow |
Will do going forward |
03:09
🔗
|
SketchCow |
If you're not doing it, or they hit me first |
03:09
🔗
|
SketchCow |
I actually can't run the tracker |
03:09
🔗
|
SketchCow |
ha |
03:29
🔗
|
|
bwn__ has quit IRC (Ping timeout: 252 seconds) |
03:33
🔗
|
|
Atom-- has joined #archiveteam |
03:38
🔗
|
|
Atom__ has quit IRC (Ping timeout: 506 seconds) |
03:51
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
04:02
🔗
|
|
chazchaz has quit IRC (ny.us.hub irc.umich.edu) |
04:02
🔗
|
|
jspiros has quit IRC (ny.us.hub irc.umich.edu) |
04:02
🔗
|
|
trs80 has quit IRC (ny.us.hub irc.umich.edu) |
04:09
🔗
|
|
atlogbot has quit IRC (Read error: Operation timed out) |
04:10
🔗
|
|
slyphic has quit IRC (Read error: Operation timed out) |
04:15
🔗
|
|
chazchaz has joined #archiveteam |
04:15
🔗
|
|
jspiros has joined #archiveteam |
04:15
🔗
|
|
trs80 has joined #archiveteam |
04:16
🔗
|
|
atlogbot has joined #archiveteam |
04:16
🔗
|
|
BlueMaxim has quit IRC (Read error: Operation timed out) |
04:17
🔗
|
|
BlueMaxim has joined #archiveteam |
04:17
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
04:23
🔗
|
|
BlueMaxim has joined #archiveteam |
04:24
🔗
|
|
slyphic has joined #archiveteam |
04:27
🔗
|
|
chazchaz has quit IRC (ny.us.hub irc.umich.edu) |
04:27
🔗
|
|
jspiros has quit IRC (ny.us.hub irc.umich.edu) |
04:27
🔗
|
|
trs80 has quit IRC (ny.us.hub irc.umich.edu) |
04:30
🔗
|
|
RedType has quit IRC (Read error: Operation timed out) |
04:39
🔗
|
|
xk_id has quit IRC (Remote host closed the connection) |
04:41
🔗
|
|
RedType has joined #archiveteam |
05:01
🔗
|
|
rentrin has joined #archiveteam |
05:04
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
05:10
🔗
|
|
rentrin has quit IRC (Read error: Connection reset by peer) |
05:10
🔗
|
|
rentrin has joined #archiveteam |
05:15
🔗
|
|
rentrin has quit IRC (Read error: Operation timed out) |
05:18
🔗
|
|
chazchaz has joined #archiveteam |
05:18
🔗
|
|
jspiros has joined #archiveteam |
05:18
🔗
|
|
trs80 has joined #archiveteam |
05:26
🔗
|
|
Smiley has joined #archiveteam |
05:27
🔗
|
|
SmileyG has quit IRC (Read error: Operation timed out) |
05:28
🔗
|
|
Barry has quit IRC (Read error: Operation timed out) |
05:28
🔗
|
|
Barry has joined #archiveteam |
05:29
🔗
|
|
lytv has quit IRC (Read error: Operation timed out) |
05:31
🔗
|
|
RedType_ has joined #archiveteam |
05:31
🔗
|
|
rduser has quit IRC (Ping timeout: 499 seconds) |
05:31
🔗
|
|
rduser has joined #archiveteam |
05:33
🔗
|
|
RedType has quit IRC (Ping timeout: 499 seconds) |
05:49
🔗
|
|
WinterFox has joined #archiveteam |
06:13
🔗
|
|
remsen has quit IRC (Read error: Connection reset by peer) |
06:18
🔗
|
|
primus104 has joined #archiveteam |
06:26
🔗
|
|
vitzli has joined #archiveteam |
06:33
🔗
|
|
primus104 has quit IRC (Leaving.) |
06:33
🔗
|
|
Atom__ has joined #archiveteam |
06:35
🔗
|
|
atlogbot has quit IRC (Ping timeout: 369 seconds) |
06:36
🔗
|
|
slyphic has quit IRC (Read error: Operation timed out) |
06:37
🔗
|
|
Elegance_ has quit IRC (Read error: Operation timed out) |
06:40
🔗
|
|
BlueMaxim has quit IRC (Read error: Operation timed out) |
06:40
🔗
|
|
BlueMaxim has joined #archiveteam |
06:40
🔗
|
|
atlogbot has joined #archiveteam |
06:41
🔗
|
|
slyphic has joined #archiveteam |
06:41
🔗
|
|
bwn has joined #archiveteam |
06:41
🔗
|
|
Atom-- has quit IRC (hub.se irc.underworld.no) |
06:41
🔗
|
|
Atluxity has quit IRC (hub.se irc.underworld.no) |
06:41
🔗
|
|
Rye has quit IRC (hub.se irc.underworld.no) |
06:51
🔗
|
|
Smiley has quit IRC (Read error: Operation timed out) |
07:10
🔗
|
|
remsen has joined #archiveteam |
07:16
🔗
|
SketchCow |
https://archive.org/download/BostonComputerSociety_Master_PC_Disks_01 |
07:18
🔗
|
SketchCow |
Wiki uploads are caught up |
07:19
🔗
|
|
cvb has joined #archiveteam |
07:19
🔗
|
JesseW |
Awesome! |
07:20
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
07:21
🔗
|
SketchCow |
Gamefront is still going on, but it's 3tb |
07:29
🔗
|
|
Deewiant has joined #archiveteam |
07:42
🔗
|
|
Rye has joined #archiveteam |
08:13
🔗
|
|
Fletcher has joined #archiveteam |
08:13
🔗
|
|
diacope has joined #archiveteam |
08:13
🔗
|
|
JesseW has quit IRC (Leaving.) |
08:14
🔗
|
|
Nemo_bis has joined #archiveteam |
08:15
🔗
|
|
Smiley has joined #archiveteam |
08:39
🔗
|
|
atomotic has joined #archiveteam |
08:40
🔗
|
|
primus104 has joined #archiveteam |
08:41
🔗
|
|
bwn_ has joined #archiveteam |
08:45
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
09:05
🔗
|
|
remsen has quit IRC (Read error: Connection reset by peer) |
09:08
🔗
|
arkiver |
chfoo: thanks |
09:08
🔗
|
arkiver |
screenr project is started! |
09:12
🔗
|
|
terburg has joined #archiveteam |
09:21
🔗
|
arkiver |
scripts for screenr updated. |
09:25
🔗
|
|
bwn_ is now known as bwn |
09:26
🔗
|
|
brayden has joined #archiveteam |
09:27
🔗
|
|
Darkstar has quit IRC (Ping timeout: 1729 seconds) |
10:11
🔗
|
|
terburg has quit IRC (Quit: terburg) |
10:13
🔗
|
|
cvb has quit IRC (Quit: Leaving) |
10:20
🔗
|
|
bzc6p__ has joined #archiveteam |
10:26
🔗
|
|
bzc6p_ has quit IRC (Ping timeout: 615 seconds) |
10:27
🔗
|
|
vOYtEC has quit IRC (Read error: Connection reset by peer) |
10:36
🔗
|
|
vOYtEC has joined #archiveteam |
10:37
🔗
|
|
cvb has joined #archiveteam |
10:53
🔗
|
|
Darkstar has joined #archiveteam |
10:54
🔗
|
|
terburg has joined #archiveteam |
10:55
🔗
|
|
Atluxity has joined #archiveteam |
10:57
🔗
|
|
Ungstein has joined #archiveteam |
11:02
🔗
|
|
VADemon has joined #archiveteam |
11:06
🔗
|
|
arkiver2 has joined #archiveteam |
11:09
🔗
|
|
xk_id has joined #archiveteam |
11:19
🔗
|
|
Ungstein has quit IRC (Ping timeout: 252 seconds) |
11:23
🔗
|
|
Ungstein has joined #archiveteam |
11:41
🔗
|
|
terburg has quit IRC (Quit: terburg) |
11:43
🔗
|
|
arkiver2 has quit IRC (Ping timeout: 252 seconds) |
11:48
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
11:52
🔗
|
|
primus104 has quit IRC (Leaving.) |
11:54
🔗
|
|
zenguy_pc has joined #archiveteam |
12:10
🔗
|
|
vitzli has joined #archiveteam |
12:10
🔗
|
|
dtm has joined #archiveteam |
12:19
🔗
|
|
WinterFox has quit IRC (Remote host closed the connection) |
12:21
🔗
|
|
dserodio has joined #archiveteam |
12:36
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
12:39
🔗
|
|
remsen has joined #archiveteam |
12:41
🔗
|
|
atomotic has joined #archiveteam |
12:49
🔗
|
|
chazchaz has quit IRC (Ping timeout: 186 seconds) |
12:50
🔗
|
|
chazchaz has joined #archiveteam |
12:57
🔗
|
|
Ungstein1 has joined #archiveteam |
12:59
🔗
|
|
Ungstein has quit IRC (Ping timeout: 252 seconds) |
13:05
🔗
|
|
terburg has joined #archiveteam |
13:09
🔗
|
|
Ymgve has quit IRC (Ping timeout: 506 seconds) |
13:16
🔗
|
|
bwn has joined #archiveteam |
13:26
🔗
|
|
terburg has quit IRC (Quit: terburg) |
13:44
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
13:56
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
13:59
🔗
|
phuzion |
Here's an Ansible playbook I wrote to configure CentOS 6.7 machines to run the screenr grab: https://gist.github.com/anonymous/3cacc8b886daf10a7698 |
13:59
🔗
|
|
dashcloud has joined #archiveteam |
14:04
🔗
|
|
scyther has joined #archiveteam |
14:08
🔗
|
phuzion |
It'll probably work on other similar-ish versions of RH based OSes, but I'm not promising anything except that it'll work on CentOS 6.7. |
14:09
🔗
|
|
Elegance has joined #archiveteam |
14:09
🔗
|
|
Elegance has quit IRC (Client Quit) |
14:17
🔗
|
|
Ungstein1 has quit IRC (Quit: Leaving.) |
14:17
🔗
|
|
Ungstein has joined #archiveteam |
14:17
🔗
|
|
nertzy has joined #archiveteam |
14:37
🔗
|
|
Elegance has joined #archiveteam |
14:42
🔗
|
|
VADemon has quit IRC (left4dead) |
14:48
🔗
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
14:49
🔗
|
|
Meeh_ has quit IRC (Quit: No Ping reply in 180 seconds.) |
14:49
🔗
|
|
Meeh has joined #archiveteam |
15:06
🔗
|
|
primus104 has joined #archiveteam |
15:17
🔗
|
|
terburg has joined #archiveteam |
15:29
🔗
|
|
primus104 has quit IRC (Leaving.) |
15:39
🔗
|
|
zenguy_pc has quit IRC (Ping timeout: 252 seconds) |
15:45
🔗
|
|
Darkstar has quit IRC (Ping timeout: 252 seconds) |
15:48
🔗
|
|
scyther has quit IRC (Read error: Connection reset by peer) |
15:49
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
16:07
🔗
|
|
Darkstar has joined #archiveteam |
16:23
🔗
|
|
terburg has quit IRC (Quit: terburg) |
16:25
🔗
|
|
nertzy has joined #archiveteam |
16:29
🔗
|
|
Xena has joined #archiveteam |
16:29
🔗
|
Xena |
Is there a place I can download all of the vaporwave.me audio files? |
16:31
🔗
|
|
bzc6p__ is now known as bzc6p |
16:32
🔗
|
phuzion |
Xena: https://web.archive.org/web/20150219214318/http://vaporwave.me/ is the wayback machine link |
16:32
🔗
|
phuzion |
I'm not 100% sure if it's possible to easily bulk-grab the files that archivebot snagged |
16:32
🔗
|
Xena |
yeah, hmm |
16:34
🔗
|
bzc6p |
Xena: You can try two things. |
16:35
🔗
|
bzc6p |
1. Go to archive.org, and in the search bar (not the Wayback search) enter "vaporwave" and see what you find. |
16:35
🔗
|
bzc6p |
2. http://web.archive.org/web/*/vaporwave.me/* and then filter to filetype. |
16:36
🔗
|
Xena |
is the internet archive okay with me scraping the HTML of things to download files? |
16:36
🔗
|
Xena |
there's about 200 GB of music i'm trying to fetch out |
16:37
🔗
|
bzc6p |
I've a better idea. |
16:39
🔗
|
|
Ymgve has joined #archiveteam |
16:40
🔗
|
bzc6p |
Use the CDX server to generate a filelist, then get'em all. |
16:40
🔗
|
Xena |
CDX? |
16:40
🔗
|
bzc6p |
https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server |
16:40
🔗
|
DFJustin |
the archivebot grabs are linked from http://archive.fart.website/archivebot/viewer/domain/vaporwave.me |
16:40
🔗
|
DFJustin |
you can get the warcs and run warctozip on them |
16:41
🔗
|
bzc6p |
^ Even better, if most of the content is the music. |
16:41
🔗
|
Xena |
where can I get warctozip? |
16:41
🔗
|
phuzion |
DFJustin: Are WARCs kept on IA permanently? |
16:42
🔗
|
phuzion |
Or are they removed after being ingested into the wayback machine? |
16:42
🔗
|
DFJustin |
permanently |
16:42
🔗
|
phuzion |
Nice to know. |
16:42
🔗
|
DFJustin |
the wayback machine serves from the warcs |
16:42
🔗
|
Xena |
is there a torrent download for warcs? |
16:43
🔗
|
DFJustin |
http://www.archiveteam.org/index.php?title=WARC |
16:43
🔗
|
DFJustin |
the archivebot go items should have torrents but you'd have to do some legwork to get all the parts |
16:44
🔗
|
DFJustin |
e.g. https://archive.org/download/archiveteam_archivebot_go_20150127000001/archiveteam_archivebot_go_20150127000001_archive.torrent has a bunch of them |
16:44
🔗
|
DFJustin |
but a bunch of other stuff too |
16:44
🔗
|
MrRadar |
I've got a complete copy of the vaporwave.me SFTP site (I grabbed it for personal use about a month before the site stopped responding), should I upload these files to the IA? It should just be duplicates of material saved from their HTTP site |
16:44
🔗
|
bzc6p |
phuzion: yes, even the Wayback content is read from 1 GB warcs. Once I was able to locate the corresponding items, but the files themselves are hidden. |
16:45
🔗
|
Xena |
https://github.com/alard/warctozip-service |
16:45
🔗
|
Xena |
is this the one I want? |
16:45
🔗
|
DFJustin |
MrRadar: our grabs were in january, would there have been new content between then and your grab |
16:46
🔗
|
MrRadar |
Probably not, though I'll double-check the modification timestamps on those files |
16:48
🔗
|
MrRadar |
Looks like the mtimes for all those files are the local time I downloaded them, not the mtime from the server |
16:49
🔗
|
MrRadar |
I don't think the site owner accepted any uploads after they posted the notice they weren't going to update until after summer |
16:49
🔗
|
MrRadar |
So I don't expect I have any unique content |
16:50
🔗
|
bzc6p |
Xena: It seems. |
16:50
🔗
|
bzc6p |
I've never tried it, though. |
16:51
🔗
|
bzc6p |
http://warctozip.archive.org/ |
16:55
🔗
|
bzc6p |
It seems to have a feature that you don't need to download the file, it fetches and converts itself. |
16:55
🔗
|
bzc6p |
However, it gives me Internal Server Error on warc.gz ... |
16:56
🔗
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
16:58
🔗
|
bzc6p |
You can try the other methods, though. Preferably first with a small file to see if it works. Have fun. |
17:00
🔗
|
|
JesseW has joined #archiveteam |
17:22
🔗
|
|
JesseW has quit IRC (Leaving.) |
17:41
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
17:41
🔗
|
SketchCow |
We need to fix it |
17:43
🔗
|
|
jspiros has quit IRC (hub.efnet.us irc.umich.edu) |
17:43
🔗
|
|
trs80 has quit IRC (hub.efnet.us irc.umich.edu) |
17:56
🔗
|
phuzion |
Is there a channel for screenr or are we just sticking around here for it? |
17:58
🔗
|
|
primus104 has joined #archiveteam |
18:06
🔗
|
Atluxity |
can I haz access to https://github.com/ArchiveTeam ? same username as nick here |
18:08
🔗
|
|
scyther has joined #archiveteam |
18:09
🔗
|
yipdw |
Atluxity: what level |
18:09
🔗
|
yipdw |
or rather, what teams |
18:11
🔗
|
Atluxity |
I dont know what teams there are... I just felt like getting my feet wet and creating some small project from scratch by myself |
18:12
🔗
|
arkiver |
What project? |
18:12
🔗
|
yipdw |
github membership is based on teams, so you'll need that first |
18:12
🔗
|
bzc6p |
phuzion: none that I know of. |
18:13
🔗
|
Atluxity |
arkiver: not sure, look at the list of proposed projects |
18:13
🔗
|
arkiver |
docstoc is already being worked on |
18:13
🔗
|
|
jspiros has joined #archiveteam |
18:13
🔗
|
|
trs80 has joined #archiveteam |
18:13
🔗
|
arkiver |
as for the others, I don't know if we're going to do a project for those |
18:14
🔗
|
Atluxity |
how about flickr? :) |
18:14
🔗
|
yipdw |
Atluxity: you can always create a repo and then request project transfer |
18:15
🔗
|
arkiver |
no. |
18:15
🔗
|
arkiver |
we're not just going to do grab big websites |
18:15
🔗
|
arkiver |
So no flickr |
18:15
🔗
|
arkiver |
if they're shutting down, we'll do them |
18:15
🔗
|
Atluxity |
yipdw: ah, sure, yeah, did not know you could do that |
18:15
🔗
|
arkiver |
but not now |
18:16
🔗
|
arkiver |
^ that was still about flickr |
18:16
🔗
|
phuzion |
Yeah, Flickr would be hundreds, if not thousands of TBs. |
18:16
🔗
|
|
jspiros has quit IRC (hub.efnet.us irc.umich.edu) |
18:16
🔗
|
|
trs80 has quit IRC (hub.efnet.us irc.umich.edu) |
18:16
🔗
|
bzc6p |
What's the status on Blogger? |
18:16
🔗
|
arkiver |
Yes |
18:16
🔗
|
arkiver |
We have the full list of websites (still need to be processed) |
18:16
🔗
|
yipdw |
hmm, Github changed the organization authorization thing a bit |
18:16
🔗
|
yipdw |
now we have "roles" |
18:17
🔗
|
yipdw |
maybe that |
18:17
🔗
|
yipdw |
will be more flexible than the old repo/group-based system |
18:17
🔗
|
|
Start has quit IRC (Read error: Connection reset by peer) |
18:17
🔗
|
arkiver |
Google stopped with their Blogger ideas, so for now other website have a higher priority |
18:18
🔗
|
yipdw |
oh, yes, there is a way |
18:18
🔗
|
|
Start has joined #archiveteam |
18:18
🔗
|
yipdw |
Atluxity: I'll see if I can enable repo creation for all members without allowing push-to-any-repo |
18:18
🔗
|
yipdw |
we currently have some users in the admins group that should not be in the admins group |
18:18
🔗
|
yipdw |
they really just need the ability to just create repos |
18:22
🔗
|
|
lytv has joined #archiveteam |
18:23
🔗
|
|
remsen has quit IRC (Read error: Operation timed out) |
18:33
🔗
|
|
kbs1 has joined #archiveteam |
18:33
🔗
|
kbs1 |
hello, does anybody have google video docid -1820933916457441927? thanks a million!:) |
18:35
🔗
|
arkiver |
SketchCow: working on scoop now |
18:38
🔗
|
phuzion |
kbs1: Do you have any more metadata about that item other than its id number? title by chance? |
18:38
🔗
|
SketchCow |
Great |
18:38
🔗
|
kbs1 |
phuzion: Yes, title is "Hycko v Inkognite" |
18:39
🔗
|
phuzion |
kbs1: Unfortuantely, I can't find anything on IA based on those two search terms, perhaps someone else knows how to find what you're looking for. |
18:40
🔗
|
SimpBrain |
rsync is maxing out on screenr i think |
18:40
🔗
|
kbs1 |
phuzion: thanks, I'll wait to see if anyone else has any more suggestions |
18:41
🔗
|
arkiver |
SimpBrain: gamefront paused. |
18:41
🔗
|
kbs1 |
phuzion: if it helps, archive.org link is https://web.archive.org/web/20110707000518/http://video.google.com/videoplay?docid=-1820933916457441927 but of course the video doesn't play |
18:46
🔗
|
Atluxity |
we are go for screenr? |
18:46
🔗
|
arkiver |
yeah, we started yesterday |
18:47
🔗
|
arkiver |
will be ~1 TB |
18:47
🔗
|
phuzion |
arkiver: are we on a tight deadline? |
18:48
🔗
|
arkiver |
yes, november 11th |
18:48
🔗
|
arkiver |
but we'll make it I think |
18:48
🔗
|
arkiver |
thought these are only videos, html will be grabbed later |
18:48
🔗
|
phuzion |
I can easily double or triple the amount of instances I'm throwing at it, should I? |
18:48
🔗
|
arkiver |
currently we're having more requests for items then how many we are handing out |
18:49
🔗
|
arkiver |
so currently we're fine |
18:49
🔗
|
phuzion |
Ok |
18:49
🔗
|
phuzion |
FOS is having a tough time keeping up with the rsync requests, right? So we shouldn't bother turning up the speed on the tracker? |
18:50
🔗
|
arkiver |
I paused gamefront, so we should be fine on that |
18:50
🔗
|
SketchCow |
Do we use phantomjs in anything? |
18:50
🔗
|
arkiver |
no |
18:51
🔗
|
SketchCow |
Just checking. Thanks |
18:51
🔗
|
arkiver |
well, yes, archivebot |
18:51
🔗
|
SketchCow |
That's something |
18:51
🔗
|
arkiver |
ok, sorry, in archivebot, not in any warrior projects |
18:51
🔗
|
arkiver |
(yet) |
18:57
🔗
|
|
kbs1 has quit IRC (Remote host closed the connection) |
19:31
🔗
|
bzc6p |
arkiver: Is it only my understanding that stuff will remain online after the 11th on Screenr? |
19:31
🔗
|
arkiver |
you're right |
19:31
🔗
|
arkiver |
"If you�d like to download your videos, please do so before November 11th. After then, you will no longer be able to log into Screenr. However, all of your embedded videos and links to your videos will continue to work." |
19:33
🔗
|
Atluxity |
I added 300 instances to screenr (10 x 30 hosts) |
19:34
🔗
|
|
sivoais has joined #archiveteam |
19:34
🔗
|
bzc6p |
Atluxity: we should be glad that there are more people who want to participate in project coding. We need such people the most. Take it as a personal thank you. – I'm going to be another such person, but I've got too much stuff to do this year on. Maybe later. |
19:56
🔗
|
|
aaaaaaaaa has joined #archiveteam |
19:56
🔗
|
SketchCow |
Trust nothing with Screenr |
20:08
🔗
|
|
username1 has quit IRC (Quit: Leaving) |
20:15
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
20:21
🔗
|
|
bzc6p_ has joined #archiveteam |
20:22
🔗
|
|
ex-parrot has joined #archiveteam |
20:23
🔗
|
ex-parrot |
morning everyone, looks like oldfriends.co.nz is going to be wiped in January if anyone is interested in scraping it: http://www.stuff.co.nz/business/73855525/old-friends-announces-closure |
20:23
🔗
|
ex-parrot |
I am fairly sure it would be the work of a couple of days for one person to scrape the whole thing... |
20:24
🔗
|
phuzion |
Perhaps it could be a job for archivebot? |
20:24
🔗
|
ex-parrot |
(and with that I have probably volunteered myself) |
20:25
🔗
|
ex-parrot |
to get all the data the scraper will need to "log in", but I think that's a bout all that is required |
20:25
🔗
|
phuzion |
That...complicates things. |
20:25
🔗
|
phuzion |
By how much, I have not the slightest clue, but it definitely complicates things. |
20:26
🔗
|
ex-parrot |
indeed. the codebase is ancient though (doesn't even HTTPS) so it'd probably be sufficient just to run a normal spider with a copy of a valid session cookie |
20:26
🔗
|
|
aaaaaaaaa has quit IRC (Read error: Connection reset by peer) |
20:27
🔗
|
phuzion |
Wow, we are screaming through screenr. Are there other items to be released down the road? |
20:30
🔗
|
|
bzc6p has quit IRC (Read error: Operation timed out) |
20:30
🔗
|
|
pgoetz has joined #archiveteam |
20:31
🔗
|
pgoetz |
I'm looking for something that will allow me to store digital records offline on a medium that will be stable over, say 10 years. |
20:32
🔗
|
|
superkuh has quit IRC (Read error: Connection reset by peer) |
20:32
🔗
|
achip |
ex-parrot, phuzion: we can put the session cookie in an archivebot pipeline similar to the "cookie-hax/internet-centrum" branch |
20:32
🔗
|
xmc |
pgoetz: https://en.wikipedia.org/wiki/Punched_tape |
20:32
🔗
|
pgoetz |
does anyone have any thoughts on/experience with this product? https://pro.sony.com/bbsc/ssr/mkt-nab2015/mkt-nab2015mediaworkflow/product-ODSD77U/ |
20:33
🔗
|
ex-parrot |
achip: sounds promising. I am heading away for the rest of the week, but I'll be back to help organise something after that |
20:33
🔗
|
achip |
sounds good, I'll poke around and I'm sure others will also have suggestions |
20:37
🔗
|
Atluxity |
bzc6p_: appreciate it :) |
20:38
🔗
|
|
superkuh has joined #archiveteam |
20:40
🔗
|
|
godane has quit IRC (Ping timeout: 310 seconds) |
20:45
🔗
|
|
bwn has joined #archiveteam |
20:45
🔗
|
|
SmileyG has joined #archiveteam |
20:48
🔗
|
|
vtyl has joined #archiveteam |
20:51
🔗
|
phuzion |
Atluxity: how many threads do you have going in total on screenr? 300? |
20:52
🔗
|
|
lytv has quit IRC (hub.se efnet.port80.se) |
20:52
🔗
|
|
Ungstein has quit IRC (hub.se efnet.port80.se) |
20:52
🔗
|
|
Smiley has quit IRC (hub.se efnet.port80.se) |
20:52
🔗
|
|
Fletcher has quit IRC (hub.se efnet.port80.se) |
20:52
🔗
|
|
diacope has quit IRC (hub.se efnet.port80.se) |
20:52
🔗
|
|
Nemo_bis has quit IRC (hub.se efnet.port80.se) |
20:52
🔗
|
|
Xena has quit IRC (hub.se efnet.port80.se) |
20:52
🔗
|
|
bzc6p_ is now known as bzc6p |
20:54
🔗
|
bzc6p |
pgoetz: http://ollydbg.de/Paperbak |
20:54
🔗
|
|
Ghost_of_ has joined #archiveteam |
20:55
🔗
|
bzc6p |
However, I'd use HDDs or better quality DVDs, regularly checked. Their high capacity is worth that little effort. |
20:55
🔗
|
Atluxity |
phuzion: yes. 10 instances on 30hosts, each has its own ip. 1cpu, some gig ram, 15gb hdd |
20:56
🔗
|
bzc6p |
Or there is something called m-disc, said to keep data for 1000 years. But that's not really tested, I guess. |
20:56
🔗
|
phuzion |
Atluxity: Interesting, I'm running 20x20, and you're catching up to me REALLY quick. |
20:58
🔗
|
pgoetz |
bzc6p: interesting. It says you can paper record 500K per sheet of paper. We have about 12TB of stuff currently. That would be 24,000,000 sheets of paper. |
20:58
🔗
|
|
godane has joined #archiveteam |
21:01
🔗
|
bzc6p |
Or 3 HDDs. Choose wisely. |
21:01
🔗
|
|
Fletcher_ has joined #archiveteam |
21:02
🔗
|
pgoetz |
bzc6p: or a stack of paper 2.4km tall. HDD's are unstable when unplugged for more than about 6 months -- the bits start to flip. DVD is kind of what the Sony does; i.e. optical. The problem is a single DVD only hold about 9.4G max. |
21:02
🔗
|
pgoetz |
We tried using double sided blu-rays, but that resulted in stacks and stacks of disks. |
21:02
🔗
|
phuzion |
pgoetz: how much data are you trying to offline-archive? |
21:04
🔗
|
Atluxity |
phuzion: that is interesting... I don't think I would be any faster with more concurrents |
21:05
🔗
|
phuzion |
Atluxity: that's what I'm wondering, if I'm not getting as much out of the 20 concurrents as I think I am |
21:05
🔗
|
phuzion |
I'm wondering if I'd get better performance by doing more instances at say 10 or 15 concurrent instances. |
21:06
🔗
|
bzc6p |
I think it's been discussed and stated that HDDs are the most compact and efficient way of storing, especially online, but even offline, with some data-refreshing sometimes. With that muchdata, I think there is no other viable option. |
21:06
🔗
|
phuzion |
Oh, I missed the 12TB part. |
21:06
🔗
|
bzc6p |
Check out: http://archiveteam.org/index.php?title=Valhalla |
21:07
🔗
|
bzc6p |
(this conversation should be move to #archiveteam-bs if to be continued. – I won't follow, though.) |
21:09
🔗
|
xmc |
yes, it's offtopic here. |
21:15
🔗
|
pgoetz |
Thanks for letting me know, I had no idea. BTW, what is the specific purpose of this channel? |
21:18
🔗
|
phuzion |
pgoetz: to discuss archiveteam projects and coordinate for them. Offtopic stuff goes in #archiveteam-bs |
21:20
🔗
|
pgoetz |
Thanks for the clarification. I was given this channel by a colleague, and will take this issue up on #archiveteam-bs, although I remain fairly convinced that the Sony optical cartridge is the only viable game in town right now. |
21:31
🔗
|
|
zenguy_pc has joined #archiveteam |
21:32
🔗
|
|
Fletcher_ is now known as fletcher |
21:32
🔗
|
|
fletcher is now known as Fletcher |
21:39
🔗
|
|
scyther has quit IRC (Quit: Leaving) |
21:50
🔗
|
SketchCow |
What's the discussion |
21:57
🔗
|
SketchCow |
------------------------------------------------------ |
21:57
🔗
|
SketchCow |
http://www.oldfriends.co.nz/ |
21:57
🔗
|
SketchCow |
Project: #nofriends |
21:57
🔗
|
SketchCow |
------------------------------------------------------ |
22:05
🔗
|
|
slyphic is now known as slyphic|a |
22:22
🔗
|
SketchCow |
godane: https://kpfa.org/programs/ |
22:23
🔗
|
SketchCow |
If you were to start grabbing the "podcast" shows, it would be great. |
22:25
🔗
|
|
anomie has quit IRC (Read error: Connection reset by peer) |
22:25
🔗
|
godane |
looks like i maybe able to 2 ways with kpfa |
22:25
🔗
|
godane |
one is just a dump of everything on a day |
22:35
🔗
|
godane |
SketchCow: they limit the bandwidth to 60k |
22:36
🔗
|
|
Emcy has joined #archiveteam |
22:36
🔗
|
|
bzc6p_ has joined #archiveteam |
22:36
🔗
|
|
anomie has joined #archiveteam |
22:40
🔗
|
Start |
should we grab thisismyjam.com soon? |
22:41
🔗
|
Start |
it was shut down currently exists as an archive |
22:41
🔗
|
Start |
*and currently |
22:41
🔗
|
phuzion |
They've got an API which is nice. |
22:41
🔗
|
|
bzc6p has quit IRC (Read error: Operation timed out) |
22:41
🔗
|
phuzion |
Well documented, too. |
22:41
🔗
|
|
Emcy_ has quit IRC (Read error: Operation timed out) |
22:47
🔗
|
|
vOYtEC has quit IRC (Ping timeout: 255 seconds) |
22:47
🔗
|
|
chazchaz has quit IRC (Read error: Operation timed out) |
22:48
🔗
|
|
atlogbot has quit IRC (Read error: Operation timed out) |
22:48
🔗
|
|
slyphic|a has quit IRC (Read error: Operation timed out) |
22:48
🔗
|
|
cvb has quit IRC (Ping timeout: 369 seconds) |
22:49
🔗
|
|
slyphic has joined #archiveteam |
22:49
🔗
|
|
cvb has joined #archiveteam |
22:49
🔗
|
|
atlogbot has joined #archiveteam |
22:53
🔗
|
|
Ghost_of_ has quit IRC (Quit: Leaving) |
22:54
🔗
|
|
chazchaz has joined #archiveteam |
23:10
🔗
|
DFJustin |
supposedly they're going to leave it up |
23:11
🔗
|
|
aaaaaaaaa has joined #archiveteam |
23:18
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
23:28
🔗
|
|
aaaaaaaa_ has joined #archiveteam |
23:28
🔗
|
|
aaaaaaaaa has quit IRC (Read error: Connection reset by peer) |
23:29
🔗
|
|
aaaaaaaa_ is now known as aaaaaaaaa |
23:35
🔗
|
|
wednesda- has quit IRC (Ping timeout: 360 seconds) |
23:45
🔗
|
|
cvb has quit IRC (Read error: Operation timed out) |
23:53
🔗
|
|
ohhdemgir has joined #archiveteam |