#archiveteam-bs 2015-09-26,Sat

↑back Search

Time Nickname Message
00:52 🔗 primus104 has quit IRC (Leaving.)
00:57 🔗 dashcloud has quit IRC (Read error: Operation timed out)
01:04 🔗 dashcloud has joined #archiveteam-bs
01:19 🔗 JesseW has joined #archiveteam-bs
01:23 🔗 toad1 has joined #archiveteam-bs
01:31 🔗 toad2 has quit IRC (Read error: Operation timed out)
02:30 🔗 furrie has joined #archiveteam-bs
02:30 🔗 furrie hi i installed newest grab-site today. what is the all_start_urls file all about?
02:39 🔗 furrie assuming ivan can help because he helped last time
02:49 🔗 aaaaaaaaa looks like it is a list of all the urls the grab starts from.
02:52 🔗 furrie even totally irrelevant ones right
02:52 🔗 furrie like i can add unprotected directories too
02:52 🔗 furrie because that's why I want to use it for
02:54 🔗 aaaaaaaaa I don't think you manually add urls to the all_start_urls. Best I can tell, that file is only written to, never read.
02:55 🔗 furrie darn
02:55 🔗 aaaaaaaaa if you want a list of urls you use a different file and the --input-file= argument
02:56 🔗 furrie i didn't find that argument under --help
02:57 🔗 aaaaaaaaa it is in the readme
02:59 🔗 furrie Aha, thanks
03:01 🔗 furrie has quit IRC (Quit: Page closed)
03:43 🔗 JesseW1 has joined #archiveteam-bs
03:45 🔗 JesseW has quit IRC (Read error: Operation timed out)
03:52 🔗 zenguy_pc has quit IRC (Read error: Connection reset by peer)
03:56 🔗 sep332 has joined #archiveteam-bs
04:05 🔗 JesseW1 has quit IRC (Ping timeout: 601 seconds)
04:09 🔗 zenguy_pc has joined #archiveteam-bs
04:11 🔗 aaaaaaaaa has quit IRC (Leaving)
04:33 🔗 JesseW has joined #archiveteam-bs
04:47 🔗 JesseW has quit IRC (Read error: Operation timed out)
04:51 🔗 yipdw wow, my gitlab 7.14 -> 8.0.2 upgrade went very well
04:51 🔗 yipdw who the hell is on gitlab's packaging team and why are there not more of them?
04:51 🔗 yipdw this is unrealistically good
05:02 🔗 JesseW has joined #archiveteam-bs
06:13 🔗 vitzli has joined #archiveteam-bs
06:38 🔗 wyatt8740 has joined #archiveteam-bs
06:41 🔗 PurpleSym has joined #archiveteam-bs
06:47 🔗 JesseW https://archive.org/stream/creativecomputing-1982-04-a/Creative_Computing_v08_n04_1982_April?ui=embed#page/n92/mode/1up <- That's a ... striking name for a technical journal...
06:47 🔗 JesseW Give yourself over to ..., and it will improve your spreadsheet program!
06:48 🔗 JesseW The creators of VisiCalc regularly speak through ..., don't you want to listen?
07:00 🔗 midas lies yipdw, stuff needs to break just to be sure the upgrade did something
07:17 🔗 JesseW has quit IRC (Read error: Operation timed out)
07:18 🔗 primus104 has joined #archiveteam-bs
07:22 🔗 vitzli has quit IRC (Quit: Leaving)
08:02 🔗 kniffy has quit IRC (Ping timeout: 240 seconds)
08:35 🔗 kniffy has joined #archiveteam-bs
08:39 🔗 kniffy has quit IRC (Ping timeout: 240 seconds)
08:44 🔗 kniffy has joined #archiveteam-bs
08:49 🔗 kniffy has quit IRC (Ping timeout: 240 seconds)
08:51 🔗 kniffy has joined #archiveteam-bs
09:02 🔗 BlueMaxim has quit IRC (Read error: Connection reset by peer)
09:06 🔗 schbirid has joined #archiveteam-bs
09:26 🔗 kniffy has quit IRC (Ping timeout: 240 seconds)
09:42 🔗 primus104 has quit IRC (Leaving.)
09:52 🔗 kniffy has joined #archiveteam-bs
09:56 🔗 kniffy has quit IRC (Ping timeout: 240 seconds)
10:24 🔗 kniffy has joined #archiveteam-bs
11:36 🔗 primus104 has joined #archiveteam-bs
11:58 🔗 brayden has quit IRC (Ping timeout: 606 seconds)
12:06 🔗 kniffy has quit IRC (Ping timeout: 240 seconds)
12:22 🔗 godane SketchCow: i'm watching your derbycon talk
12:23 🔗 godane SketchCow: btw there was some rare art work on AOL CDs by famous actors kids i think at one point
12:23 🔗 godane i know thinks cause it was talked about on TechTV
12:23 🔗 godane when the other guy wanted 1M aol cds
12:24 🔗 primus104 has quit IRC (Leaving.)
12:53 🔗 kniffy has joined #archiveteam-bs
12:58 🔗 SimpBrain has joined #archiveteam-bs
13:06 🔗 brayden has joined #archiveteam-bs
13:06 🔗 swebb sets mode: +o brayden
14:18 🔗 SN4T14 has quit IRC (Ping timeout: 306 seconds)
14:42 🔗 dashcloud has quit IRC (Read error: Operation timed out)
14:49 🔗 dashcloud has joined #archiveteam-bs
14:58 🔗 JesseW has joined #archiveteam-bs
15:01 🔗 primus104 has joined #archiveteam-bs
15:02 🔗 SN4T14 has joined #archiveteam-bs
15:09 🔗 JesseW has quit IRC (Leaving.)
15:10 🔗 JesseW has joined #archiveteam-bs
15:19 🔗 JesseW has quit IRC (Read error: Operation timed out)
15:46 🔗 garyrh has quit IRC (Read error: Connection reset by peer)
16:30 🔗 RichardG has quit IRC (Read error: Connection reset by peer)
16:31 🔗 RichardG has joined #archiveteam-bs
16:47 🔗 arkiver2 has joined #archiveteam-bs
17:06 🔗 RichardG has quit IRC (Read error: Connection reset by peer)
17:06 🔗 RichardG has joined #archiveteam-bs
17:11 🔗 garyrh has joined #archiveteam-bs
17:38 🔗 godane i found something interesting
17:38 🔗 godane turns out that juurneyman.tv has download.php?id=$n urls
17:38 🔗 godane the video numbers and the ids are completely different
17:39 🔗 godane example: http://www.journeyman.tv/download.php?id=1
17:39 🔗 godane it goes to http://www.journeyman.co.uk/media/video/97.flv
17:49 🔗 godane download id 3 got to 258.flv: http://www.journeyman.co.uk/media/video/258.flv
17:50 🔗 godane that just to prove that they do work
17:52 🔗 xmc neat
17:55 🔗 godane metadata maybe a problem with this though
18:03 🔗 godane it may not get metadata now looking at it
18:03 🔗 godane it will just be a journeyman-pictures-download-id-$i item
18:26 🔗 godane you can also do this: curl -s http://www.journeyman.tv/9000/short-films/ | grep -A1 playerCont | sed 's|.*href="||g' | sed 's|">.*||g' | grep ^http
18:26 🔗 godane using that id will get metadata
18:27 🔗 aaaaaaaaa has joined #archiveteam-bs
18:27 🔗 swebb sets mode: +o aaaaaaaaa
18:51 🔗 SimpBrain has quit IRC (Leaving)
18:53 🔗 arkiver2 godane: are you going to grab all those?
18:53 🔗 primus104 has quit IRC (Leaving.)
19:04 🔗 godane maybe
19:04 🔗 godane i'm doing it using the download id
19:04 🔗 godane metadata is going to be a problem for these items
19:26 🔗 arkiver2 has quit IRC (Ping timeout: 252 seconds)
19:29 🔗 primus104 has joined #archiveteam-bs
19:31 🔗 primus105 has joined #archiveteam-bs
19:33 🔗 SimpBrain has joined #archiveteam-bs
19:37 🔗 primus104 has quit IRC (Read error: Operation timed out)
19:41 🔗 dashcloud has quit IRC (Read error: Operation timed out)
19:49 🔗 dashcloud has joined #archiveteam-bs
20:10 🔗 aaaaaaaa_ has joined #archiveteam-bs
20:10 🔗 aaaaaaaaa has quit IRC (Read error: Connection reset by peer)
20:10 🔗 swebb sets mode: +o aaaaaaaa_
20:35 🔗 dashcloud has quit IRC (Read error: Operation timed out)
20:35 🔗 arkiver2 has joined #archiveteam-bs
20:42 🔗 dashcloud has joined #archiveteam-bs
20:57 🔗 aaaaaaaa_ is now known as aaaaaaaaa
21:13 🔗 JesseW has joined #archiveteam-bs
21:13 🔗 PurpleSym has quit IRC (Remote host closed the connection)
21:21 🔗 JesseW has quit IRC (Read error: Operation timed out)
22:10 🔗 arkiver2 has quit IRC (Ping timeout: 252 seconds)
22:11 🔗 arkiver godane: if you'd like I can see if I can get the metadata for you
23:38 🔗 bentpins thingiverse ~~ rsync: mkstemp "/warrior/thingiverse/trill/.thingiverse-thing_7454-20150926-190955.warc.gz.QpGl8m" (in chfoo) failed: Permission denied (13)

irclogger-viewer