[00:52] *** primus104 has quit IRC (Leaving.) [00:57] *** dashcloud has quit IRC (Read error: Operation timed out) [01:04] *** dashcloud has joined #archiveteam-bs [01:19] *** JesseW has joined #archiveteam-bs [01:23] *** toad1 has joined #archiveteam-bs [01:31] *** toad2 has quit IRC (Read error: Operation timed out) [02:30] *** furrie has joined #archiveteam-bs [02:30] hi i installed newest grab-site today. what is the all_start_urls file all about? [02:39] assuming ivan can help because he helped last time [02:49] looks like it is a list of all the urls the grab starts from. [02:52] even totally irrelevant ones right [02:52] like i can add unprotected directories too [02:52] because that's why I want to use it for [02:54] I don't think you manually add urls to the all_start_urls. Best I can tell, that file is only written to, never read. [02:55] darn [02:55] if you want a list of urls you use a different file and the --input-file= argument [02:56] i didn't find that argument under --help [02:57] it is in the readme [02:59] Aha, thanks [03:01] *** furrie has quit IRC (Quit: Page closed) [03:43] *** JesseW1 has joined #archiveteam-bs [03:45] *** JesseW has quit IRC (Read error: Operation timed out) [03:52] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [03:56] *** sep332 has joined #archiveteam-bs [04:05] *** JesseW1 has quit IRC (Ping timeout: 601 seconds) [04:09] *** zenguy_pc has joined #archiveteam-bs [04:11] *** aaaaaaaaa has quit IRC (Leaving) [04:33] *** JesseW has joined #archiveteam-bs [04:47] *** JesseW has quit IRC (Read error: Operation timed out) [04:51] wow, my gitlab 7.14 -> 8.0.2 upgrade went very well [04:51] who the hell is on gitlab's packaging team and why are there not more of them? [04:51] this is unrealistically good [05:02] *** JesseW has joined #archiveteam-bs [06:13] *** vitzli has joined #archiveteam-bs [06:38] *** wyatt8740 has joined #archiveteam-bs [06:41] *** PurpleSym has joined #archiveteam-bs [06:47] https://archive.org/stream/creativecomputing-1982-04-a/Creative_Computing_v08_n04_1982_April?ui=embed#page/n92/mode/1up <- That's a ... striking name for a technical journal... [06:47] Give yourself over to ..., and it will improve your spreadsheet program! [06:48] The creators of VisiCalc regularly speak through ..., don't you want to listen? [07:00] lies yipdw, stuff needs to break just to be sure the upgrade did something [07:17] *** JesseW has quit IRC (Read error: Operation timed out) [07:18] *** primus104 has joined #archiveteam-bs [07:22] *** vitzli has quit IRC (Quit: Leaving) [08:02] *** kniffy has quit IRC (Ping timeout: 240 seconds) [08:35] *** kniffy has joined #archiveteam-bs [08:39] *** kniffy has quit IRC (Ping timeout: 240 seconds) [08:44] *** kniffy has joined #archiveteam-bs [08:49] *** kniffy has quit IRC (Ping timeout: 240 seconds) [08:51] *** kniffy has joined #archiveteam-bs [09:02] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [09:06] *** schbirid has joined #archiveteam-bs [09:26] *** kniffy has quit IRC (Ping timeout: 240 seconds) [09:42] *** primus104 has quit IRC (Leaving.) [09:52] *** kniffy has joined #archiveteam-bs [09:56] *** kniffy has quit IRC (Ping timeout: 240 seconds) [10:24] *** kniffy has joined #archiveteam-bs [11:36] *** primus104 has joined #archiveteam-bs [11:58] *** brayden has quit IRC (Ping timeout: 606 seconds) [12:06] *** kniffy has quit IRC (Ping timeout: 240 seconds) [12:22] SketchCow: i'm watching your derbycon talk [12:23] SketchCow: btw there was some rare art work on AOL CDs by famous actors kids i think at one point [12:23] i know thinks cause it was talked about on TechTV [12:23] when the other guy wanted 1M aol cds [12:24] *** primus104 has quit IRC (Leaving.) [12:53] *** kniffy has joined #archiveteam-bs [12:58] *** SimpBrain has joined #archiveteam-bs [13:06] *** brayden has joined #archiveteam-bs [13:06] *** swebb sets mode: +o brayden [14:18] *** SN4T14 has quit IRC (Ping timeout: 306 seconds) [14:42] *** dashcloud has quit IRC (Read error: Operation timed out) [14:49] *** dashcloud has joined #archiveteam-bs [14:58] *** JesseW has joined #archiveteam-bs [15:01] *** primus104 has joined #archiveteam-bs [15:02] *** SN4T14 has joined #archiveteam-bs [15:09] *** JesseW has quit IRC (Leaving.) [15:10] *** JesseW has joined #archiveteam-bs [15:19] *** JesseW has quit IRC (Read error: Operation timed out) [15:46] *** garyrh has quit IRC (Read error: Connection reset by peer) [16:30] *** RichardG has quit IRC (Read error: Connection reset by peer) [16:31] *** RichardG has joined #archiveteam-bs [16:47] *** arkiver2 has joined #archiveteam-bs [17:06] *** RichardG has quit IRC (Read error: Connection reset by peer) [17:06] *** RichardG has joined #archiveteam-bs [17:11] *** garyrh has joined #archiveteam-bs [17:38] i found something interesting [17:38] turns out that juurneyman.tv has download.php?id=$n urls [17:38] the video numbers and the ids are completely different [17:39] example: http://www.journeyman.tv/download.php?id=1 [17:39] it goes to http://www.journeyman.co.uk/media/video/97.flv [17:49] download id 3 got to 258.flv: http://www.journeyman.co.uk/media/video/258.flv [17:50] that just to prove that they do work [17:52] neat [17:55] metadata maybe a problem with this though [18:03] it may not get metadata now looking at it [18:03] it will just be a journeyman-pictures-download-id-$i item [18:26] you can also do this: curl -s http://www.journeyman.tv/9000/short-films/ | grep -A1 playerCont | sed 's|.*href="||g' | sed 's|">.*||g' | grep ^http [18:26] using that id will get metadata [18:27] *** aaaaaaaaa has joined #archiveteam-bs [18:27] *** swebb sets mode: +o aaaaaaaaa [18:51] *** SimpBrain has quit IRC (Leaving) [18:53] godane: are you going to grab all those? [18:53] *** primus104 has quit IRC (Leaving.) [19:04] maybe [19:04] i'm doing it using the download id [19:04] metadata is going to be a problem for these items [19:26] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [19:29] *** primus104 has joined #archiveteam-bs [19:31] *** primus105 has joined #archiveteam-bs [19:33] *** SimpBrain has joined #archiveteam-bs [19:37] *** primus104 has quit IRC (Read error: Operation timed out) [19:41] *** dashcloud has quit IRC (Read error: Operation timed out) [19:49] *** dashcloud has joined #archiveteam-bs [20:10] *** aaaaaaaa_ has joined #archiveteam-bs [20:10] *** aaaaaaaaa has quit IRC (Read error: Connection reset by peer) [20:10] *** swebb sets mode: +o aaaaaaaa_ [20:35] *** dashcloud has quit IRC (Read error: Operation timed out) [20:35] *** arkiver2 has joined #archiveteam-bs [20:42] *** dashcloud has joined #archiveteam-bs [20:57] *** aaaaaaaa_ is now known as aaaaaaaaa [21:13] *** JesseW has joined #archiveteam-bs [21:13] *** PurpleSym has quit IRC (Remote host closed the connection) [21:21] *** JesseW has quit IRC (Read error: Operation timed out) [22:10] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [22:11] godane: if you'd like I can see if I can get the metadata for you [23:38] thingiverse ~~ rsync: mkstemp "/warrior/thingiverse/trill/.thingiverse-thing_7454-20150926-190955.warc.gz.QpGl8m" (in chfoo) failed: Permission denied (13)