| Time |
Nickname |
Message |
|
00:52
🔗
|
|
primus104 has quit IRC (Leaving.) |
|
00:57
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
|
01:04
🔗
|
|
dashcloud has joined #archiveteam-bs |
|
01:19
🔗
|
|
JesseW has joined #archiveteam-bs |
|
01:23
🔗
|
|
toad1 has joined #archiveteam-bs |
|
01:31
🔗
|
|
toad2 has quit IRC (Read error: Operation timed out) |
|
02:30
🔗
|
|
furrie has joined #archiveteam-bs |
|
02:30
🔗
|
furrie |
hi i installed newest grab-site today. what is the all_start_urls file all about? |
|
02:39
🔗
|
furrie |
assuming ivan can help because he helped last time |
|
02:49
🔗
|
aaaaaaaaa |
looks like it is a list of all the urls the grab starts from. |
|
02:52
🔗
|
furrie |
even totally irrelevant ones right |
|
02:52
🔗
|
furrie |
like i can add unprotected directories too |
|
02:52
🔗
|
furrie |
because that's why I want to use it for |
|
02:54
🔗
|
aaaaaaaaa |
I don't think you manually add urls to the all_start_urls. Best I can tell, that file is only written to, never read. |
|
02:55
🔗
|
furrie |
darn |
|
02:55
🔗
|
aaaaaaaaa |
if you want a list of urls you use a different file and the --input-file= argument |
|
02:56
🔗
|
furrie |
i didn't find that argument under --help |
|
02:57
🔗
|
aaaaaaaaa |
it is in the readme |
|
02:59
🔗
|
furrie |
Aha, thanks |
|
03:01
🔗
|
|
furrie has quit IRC (Quit: Page closed) |
|
03:43
🔗
|
|
JesseW1 has joined #archiveteam-bs |
|
03:45
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
|
03:52
🔗
|
|
zenguy_pc has quit IRC (Read error: Connection reset by peer) |
|
03:56
🔗
|
|
sep332 has joined #archiveteam-bs |
|
04:05
🔗
|
|
JesseW1 has quit IRC (Ping timeout: 601 seconds) |
|
04:09
🔗
|
|
zenguy_pc has joined #archiveteam-bs |
|
04:11
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
|
04:33
🔗
|
|
JesseW has joined #archiveteam-bs |
|
04:47
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
|
04:51
🔗
|
yipdw |
wow, my gitlab 7.14 -> 8.0.2 upgrade went very well |
|
04:51
🔗
|
yipdw |
who the hell is on gitlab's packaging team and why are there not more of them? |
|
04:51
🔗
|
yipdw |
this is unrealistically good |
|
05:02
🔗
|
|
JesseW has joined #archiveteam-bs |
|
06:13
🔗
|
|
vitzli has joined #archiveteam-bs |
|
06:38
🔗
|
|
wyatt8740 has joined #archiveteam-bs |
|
06:41
🔗
|
|
PurpleSym has joined #archiveteam-bs |
|
06:47
🔗
|
JesseW |
https://archive.org/stream/creativecomputing-1982-04-a/Creative_Computing_v08_n04_1982_April?ui=embed#page/n92/mode/1up <- That's a ... striking name for a technical journal... |
|
06:47
🔗
|
JesseW |
Give yourself over to ..., and it will improve your spreadsheet program! |
|
06:48
🔗
|
JesseW |
The creators of VisiCalc regularly speak through ..., don't you want to listen? |
|
07:00
🔗
|
midas |
lies yipdw, stuff needs to break just to be sure the upgrade did something |
|
07:17
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
|
07:18
🔗
|
|
primus104 has joined #archiveteam-bs |
|
07:22
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
|
08:02
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
|
08:35
🔗
|
|
kniffy has joined #archiveteam-bs |
|
08:39
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
|
08:44
🔗
|
|
kniffy has joined #archiveteam-bs |
|
08:49
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
|
08:51
🔗
|
|
kniffy has joined #archiveteam-bs |
|
09:02
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
|
09:06
🔗
|
|
schbirid has joined #archiveteam-bs |
|
09:26
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
|
09:42
🔗
|
|
primus104 has quit IRC (Leaving.) |
|
09:52
🔗
|
|
kniffy has joined #archiveteam-bs |
|
09:56
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
|
10:24
🔗
|
|
kniffy has joined #archiveteam-bs |
|
11:36
🔗
|
|
primus104 has joined #archiveteam-bs |
|
11:58
🔗
|
|
brayden has quit IRC (Ping timeout: 606 seconds) |
|
12:06
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
|
12:22
🔗
|
godane |
SketchCow: i'm watching your derbycon talk |
|
12:23
🔗
|
godane |
SketchCow: btw there was some rare art work on AOL CDs by famous actors kids i think at one point |
|
12:23
🔗
|
godane |
i know thinks cause it was talked about on TechTV |
|
12:23
🔗
|
godane |
when the other guy wanted 1M aol cds |
|
12:24
🔗
|
|
primus104 has quit IRC (Leaving.) |
|
12:53
🔗
|
|
kniffy has joined #archiveteam-bs |
|
12:58
🔗
|
|
SimpBrain has joined #archiveteam-bs |
|
13:06
🔗
|
|
brayden has joined #archiveteam-bs |
|
13:06
🔗
|
|
swebb sets mode: +o brayden |
|
14:18
🔗
|
|
SN4T14 has quit IRC (Ping timeout: 306 seconds) |
|
14:42
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
|
14:49
🔗
|
|
dashcloud has joined #archiveteam-bs |
|
14:58
🔗
|
|
JesseW has joined #archiveteam-bs |
|
15:01
🔗
|
|
primus104 has joined #archiveteam-bs |
|
15:02
🔗
|
|
SN4T14 has joined #archiveteam-bs |
|
15:09
🔗
|
|
JesseW has quit IRC (Leaving.) |
|
15:10
🔗
|
|
JesseW has joined #archiveteam-bs |
|
15:19
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
|
15:46
🔗
|
|
garyrh has quit IRC (Read error: Connection reset by peer) |
|
16:30
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
|
16:31
🔗
|
|
RichardG has joined #archiveteam-bs |
|
16:47
🔗
|
|
arkiver2 has joined #archiveteam-bs |
|
17:06
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
|
17:06
🔗
|
|
RichardG has joined #archiveteam-bs |
|
17:11
🔗
|
|
garyrh has joined #archiveteam-bs |
|
17:38
🔗
|
godane |
i found something interesting |
|
17:38
🔗
|
godane |
turns out that juurneyman.tv has download.php?id=$n urls |
|
17:38
🔗
|
godane |
the video numbers and the ids are completely different |
|
17:39
🔗
|
godane |
example: http://www.journeyman.tv/download.php?id=1 |
|
17:39
🔗
|
godane |
it goes to http://www.journeyman.co.uk/media/video/97.flv |
|
17:49
🔗
|
godane |
download id 3 got to 258.flv: http://www.journeyman.co.uk/media/video/258.flv |
|
17:50
🔗
|
godane |
that just to prove that they do work |
|
17:52
🔗
|
xmc |
neat |
|
17:55
🔗
|
godane |
metadata maybe a problem with this though |
|
18:03
🔗
|
godane |
it may not get metadata now looking at it |
|
18:03
🔗
|
godane |
it will just be a journeyman-pictures-download-id-$i item |
|
18:26
🔗
|
godane |
you can also do this: curl -s http://www.journeyman.tv/9000/short-films/ | grep -A1 playerCont | sed 's|.*href="||g' | sed 's|">.*||g' | grep ^http |
|
18:26
🔗
|
godane |
using that id will get metadata |
|
18:27
🔗
|
|
aaaaaaaaa has joined #archiveteam-bs |
|
18:27
🔗
|
|
swebb sets mode: +o aaaaaaaaa |
|
18:51
🔗
|
|
SimpBrain has quit IRC (Leaving) |
|
18:53
🔗
|
arkiver2 |
godane: are you going to grab all those? |
|
18:53
🔗
|
|
primus104 has quit IRC (Leaving.) |
|
19:04
🔗
|
godane |
maybe |
|
19:04
🔗
|
godane |
i'm doing it using the download id |
|
19:04
🔗
|
godane |
metadata is going to be a problem for these items |
|
19:26
🔗
|
|
arkiver2 has quit IRC (Ping timeout: 252 seconds) |
|
19:29
🔗
|
|
primus104 has joined #archiveteam-bs |
|
19:31
🔗
|
|
primus105 has joined #archiveteam-bs |
|
19:33
🔗
|
|
SimpBrain has joined #archiveteam-bs |
|
19:37
🔗
|
|
primus104 has quit IRC (Read error: Operation timed out) |
|
19:41
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
|
19:49
🔗
|
|
dashcloud has joined #archiveteam-bs |
|
20:10
🔗
|
|
aaaaaaaa_ has joined #archiveteam-bs |
|
20:10
🔗
|
|
aaaaaaaaa has quit IRC (Read error: Connection reset by peer) |
|
20:10
🔗
|
|
swebb sets mode: +o aaaaaaaa_ |
|
20:35
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
|
20:35
🔗
|
|
arkiver2 has joined #archiveteam-bs |
|
20:42
🔗
|
|
dashcloud has joined #archiveteam-bs |
|
20:57
🔗
|
|
aaaaaaaa_ is now known as aaaaaaaaa |
|
21:13
🔗
|
|
JesseW has joined #archiveteam-bs |
|
21:13
🔗
|
|
PurpleSym has quit IRC (Remote host closed the connection) |
|
21:21
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
|
22:10
🔗
|
|
arkiver2 has quit IRC (Ping timeout: 252 seconds) |
|
22:11
🔗
|
arkiver |
godane: if you'd like I can see if I can get the metadata for you |
|
23:38
🔗
|
bentpins |
thingiverse ~~ rsync: mkstemp "/warrior/thingiverse/trill/.thingiverse-thing_7454-20150926-190955.warc.gz.QpGl8m" (in chfoo) failed: Permission denied (13) |