Time |
Nickname |
Message |
00:52
🔗
|
|
primus104 has quit IRC (Leaving.) |
00:57
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
01:04
🔗
|
|
dashcloud has joined #archiveteam-bs |
01:19
🔗
|
|
JesseW has joined #archiveteam-bs |
01:23
🔗
|
|
toad1 has joined #archiveteam-bs |
01:31
🔗
|
|
toad2 has quit IRC (Read error: Operation timed out) |
02:30
🔗
|
|
furrie has joined #archiveteam-bs |
02:30
🔗
|
furrie |
hi i installed newest grab-site today. what is the all_start_urls file all about? |
02:39
🔗
|
furrie |
assuming ivan can help because he helped last time |
02:49
🔗
|
aaaaaaaaa |
looks like it is a list of all the urls the grab starts from. |
02:52
🔗
|
furrie |
even totally irrelevant ones right |
02:52
🔗
|
furrie |
like i can add unprotected directories too |
02:52
🔗
|
furrie |
because that's why I want to use it for |
02:54
🔗
|
aaaaaaaaa |
I don't think you manually add urls to the all_start_urls. Best I can tell, that file is only written to, never read. |
02:55
🔗
|
furrie |
darn |
02:55
🔗
|
aaaaaaaaa |
if you want a list of urls you use a different file and the --input-file= argument |
02:56
🔗
|
furrie |
i didn't find that argument under --help |
02:57
🔗
|
aaaaaaaaa |
it is in the readme |
02:59
🔗
|
furrie |
Aha, thanks |
03:01
🔗
|
|
furrie has quit IRC (Quit: Page closed) |
03:43
🔗
|
|
JesseW1 has joined #archiveteam-bs |
03:45
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
03:52
🔗
|
|
zenguy_pc has quit IRC (Read error: Connection reset by peer) |
03:56
🔗
|
|
sep332 has joined #archiveteam-bs |
04:05
🔗
|
|
JesseW1 has quit IRC (Ping timeout: 601 seconds) |
04:09
🔗
|
|
zenguy_pc has joined #archiveteam-bs |
04:11
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
04:33
🔗
|
|
JesseW has joined #archiveteam-bs |
04:47
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
04:51
🔗
|
yipdw |
wow, my gitlab 7.14 -> 8.0.2 upgrade went very well |
04:51
🔗
|
yipdw |
who the hell is on gitlab's packaging team and why are there not more of them? |
04:51
🔗
|
yipdw |
this is unrealistically good |
05:02
🔗
|
|
JesseW has joined #archiveteam-bs |
06:13
🔗
|
|
vitzli has joined #archiveteam-bs |
06:38
🔗
|
|
wyatt8740 has joined #archiveteam-bs |
06:41
🔗
|
|
PurpleSym has joined #archiveteam-bs |
06:47
🔗
|
JesseW |
https://archive.org/stream/creativecomputing-1982-04-a/Creative_Computing_v08_n04_1982_April?ui=embed#page/n92/mode/1up <- That's a ... striking name for a technical journal... |
06:47
🔗
|
JesseW |
Give yourself over to ..., and it will improve your spreadsheet program! |
06:48
🔗
|
JesseW |
The creators of VisiCalc regularly speak through ..., don't you want to listen? |
07:00
🔗
|
midas |
lies yipdw, stuff needs to break just to be sure the upgrade did something |
07:17
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
07:18
🔗
|
|
primus104 has joined #archiveteam-bs |
07:22
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
08:02
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
08:35
🔗
|
|
kniffy has joined #archiveteam-bs |
08:39
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
08:44
🔗
|
|
kniffy has joined #archiveteam-bs |
08:49
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
08:51
🔗
|
|
kniffy has joined #archiveteam-bs |
09:02
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
09:06
🔗
|
|
schbirid has joined #archiveteam-bs |
09:26
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
09:42
🔗
|
|
primus104 has quit IRC (Leaving.) |
09:52
🔗
|
|
kniffy has joined #archiveteam-bs |
09:56
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
10:24
🔗
|
|
kniffy has joined #archiveteam-bs |
11:36
🔗
|
|
primus104 has joined #archiveteam-bs |
11:58
🔗
|
|
brayden has quit IRC (Ping timeout: 606 seconds) |
12:06
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
12:22
🔗
|
godane |
SketchCow: i'm watching your derbycon talk |
12:23
🔗
|
godane |
SketchCow: btw there was some rare art work on AOL CDs by famous actors kids i think at one point |
12:23
🔗
|
godane |
i know thinks cause it was talked about on TechTV |
12:23
🔗
|
godane |
when the other guy wanted 1M aol cds |
12:24
🔗
|
|
primus104 has quit IRC (Leaving.) |
12:53
🔗
|
|
kniffy has joined #archiveteam-bs |
12:58
🔗
|
|
SimpBrain has joined #archiveteam-bs |
13:06
🔗
|
|
brayden has joined #archiveteam-bs |
13:06
🔗
|
|
swebb sets mode: +o brayden |
14:18
🔗
|
|
SN4T14 has quit IRC (Ping timeout: 306 seconds) |
14:42
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
14:49
🔗
|
|
dashcloud has joined #archiveteam-bs |
14:58
🔗
|
|
JesseW has joined #archiveteam-bs |
15:01
🔗
|
|
primus104 has joined #archiveteam-bs |
15:02
🔗
|
|
SN4T14 has joined #archiveteam-bs |
15:09
🔗
|
|
JesseW has quit IRC (Leaving.) |
15:10
🔗
|
|
JesseW has joined #archiveteam-bs |
15:19
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
15:46
🔗
|
|
garyrh has quit IRC (Read error: Connection reset by peer) |
16:30
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
16:31
🔗
|
|
RichardG has joined #archiveteam-bs |
16:47
🔗
|
|
arkiver2 has joined #archiveteam-bs |
17:06
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
17:06
🔗
|
|
RichardG has joined #archiveteam-bs |
17:11
🔗
|
|
garyrh has joined #archiveteam-bs |
17:38
🔗
|
godane |
i found something interesting |
17:38
🔗
|
godane |
turns out that juurneyman.tv has download.php?id=$n urls |
17:38
🔗
|
godane |
the video numbers and the ids are completely different |
17:39
🔗
|
godane |
example: http://www.journeyman.tv/download.php?id=1 |
17:39
🔗
|
godane |
it goes to http://www.journeyman.co.uk/media/video/97.flv |
17:49
🔗
|
godane |
download id 3 got to 258.flv: http://www.journeyman.co.uk/media/video/258.flv |
17:50
🔗
|
godane |
that just to prove that they do work |
17:52
🔗
|
xmc |
neat |
17:55
🔗
|
godane |
metadata maybe a problem with this though |
18:03
🔗
|
godane |
it may not get metadata now looking at it |
18:03
🔗
|
godane |
it will just be a journeyman-pictures-download-id-$i item |
18:26
🔗
|
godane |
you can also do this: curl -s http://www.journeyman.tv/9000/short-films/ | grep -A1 playerCont | sed 's|.*href="||g' | sed 's|">.*||g' | grep ^http |
18:26
🔗
|
godane |
using that id will get metadata |
18:27
🔗
|
|
aaaaaaaaa has joined #archiveteam-bs |
18:27
🔗
|
|
swebb sets mode: +o aaaaaaaaa |
18:51
🔗
|
|
SimpBrain has quit IRC (Leaving) |
18:53
🔗
|
arkiver2 |
godane: are you going to grab all those? |
18:53
🔗
|
|
primus104 has quit IRC (Leaving.) |
19:04
🔗
|
godane |
maybe |
19:04
🔗
|
godane |
i'm doing it using the download id |
19:04
🔗
|
godane |
metadata is going to be a problem for these items |
19:26
🔗
|
|
arkiver2 has quit IRC (Ping timeout: 252 seconds) |
19:29
🔗
|
|
primus104 has joined #archiveteam-bs |
19:31
🔗
|
|
primus105 has joined #archiveteam-bs |
19:33
🔗
|
|
SimpBrain has joined #archiveteam-bs |
19:37
🔗
|
|
primus104 has quit IRC (Read error: Operation timed out) |
19:41
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
19:49
🔗
|
|
dashcloud has joined #archiveteam-bs |
20:10
🔗
|
|
aaaaaaaa_ has joined #archiveteam-bs |
20:10
🔗
|
|
aaaaaaaaa has quit IRC (Read error: Connection reset by peer) |
20:10
🔗
|
|
swebb sets mode: +o aaaaaaaa_ |
20:35
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
20:35
🔗
|
|
arkiver2 has joined #archiveteam-bs |
20:42
🔗
|
|
dashcloud has joined #archiveteam-bs |
20:57
🔗
|
|
aaaaaaaa_ is now known as aaaaaaaaa |
21:13
🔗
|
|
JesseW has joined #archiveteam-bs |
21:13
🔗
|
|
PurpleSym has quit IRC (Remote host closed the connection) |
21:21
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
22:10
🔗
|
|
arkiver2 has quit IRC (Ping timeout: 252 seconds) |
22:11
🔗
|
arkiver |
godane: if you'd like I can see if I can get the metadata for you |
23:38
🔗
|
bentpins |
thingiverse ~~ rsync: mkstemp "/warrior/thingiverse/trill/.thingiverse-thing_7454-20150926-190955.warc.gz.QpGl8m" (in chfoo) failed: Permission denied (13) |