[00:06] *** brayden has quit IRC (Read error: Operation timed out) [00:19] *** JesseW has joined #archiveteam-bs [00:42] *** RichardG has quit IRC (Read error: Connection reset by peer) [00:43] *** RichardG has joined #archiveteam-bs [00:47] JesseW: i believe i've got things set up and working correctly, ran a test of 10 identifiers through last night, i'd like you to take a look at the output [00:55] *** Jonimus has joined #archiveteam-bs [00:55] *** swebb sets mode: +o Jonimus [01:05] *** MrRadar has quit IRC (Quit: Rebooting) [01:10] *** MrRadar has joined #archiveteam-bs [01:27] *** _desu___ has joined #archiveteam-bs [02:36] *** BlueMaxim has quit IRC (Read error: Operation timed out) [03:00] *** bwn has quit IRC (Ping timeout: 492 seconds) [03:13] *** Yoshimura has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) [03:18] SketchCow: looks like archive.org is having time out issues [03:18] good news is i think i got all of 2015 kpfa mp3s uploaded [03:18] *** Yoshimura has joined #archiveteam-bs [03:19] now i can take my slow ass time with the rest of kpfa if i want too [03:32] *** brayden has joined #archiveteam-bs [03:32] *** swebb sets mode: +o brayden [03:40] *** Yoshimura has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) [03:41] *** Yoshimura has joined #archiveteam-bs [04:27] *** bwn_ has joined #archiveteam-bs [04:33] *** bwn_ is now known as bwn [04:35] *** xXx_ndidd has joined #archiveteam-bs [04:36] *** Famicoman has quit IRC (Ping timeout: 260 seconds) [04:48] *** ndiddy has quit IRC (Read error: Operation timed out) [05:01] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [05:08] *** Sk1d has joined #archiveteam-bs [05:21] *** Famicoman has joined #archiveteam-bs [05:21] *** Honno has joined #archiveteam-bs [05:36] *** metalcamp has joined #archiveteam-bs [06:15] *** JesseW has quit IRC (Ping timeout: 370 seconds) [06:23] *** RichardG has quit IRC (Read error: Operation timed out) [07:05] *** RedType has joined #archiveteam-bs [07:13] *** schbirid has joined #archiveteam-bs [07:31] *** xXx_ndidd has quit IRC (Read error: Operation timed out) [07:40] *** RichardG has joined #archiveteam-bs [07:51] *** bwn has quit IRC (Read error: Operation timed out) [08:16] *** Muad-Dib has quit IRC (Quit: ZNC - http://znc.in) [08:19] *** bwn has joined #archiveteam-bs [08:19] *** slyphic has quit IRC (Quit: Lost terminal) [10:56] *** signius has quit IRC (Read error: Operation timed out) [11:07] *** signius has joined #archiveteam-bs [11:34] *** slpeeds has quit IRC (Remote host closed the connection) [11:35] *** fdo54ss has joined #archiveteam-bs [13:38] *** twrist has joined #archiveteam-bs [13:38] *** GLaDOS has quit IRC (Ping timeout: 260 seconds) [13:38] *** twrist is now known as GLaDOS [14:19] *** Start has quit IRC (Quit: Disconnected.) [14:54] *** Start has joined #archiveteam-bs [15:09] Huh. [16:06] *** Start has quit IRC (Quit: Disconnected.) [16:07] Anyone knows how to specify port number when running scripts manually? [16:08] use --port 1337 [16:09] For pipeline.py? [16:26] run-pipeline, I did overlook that, damn. [18:01] *** bwn has quit IRC (Ping timeout: 246 seconds) [18:11] *** Medowar has joined #archiveteam-bs [18:33] *** bwn has joined #archiveteam-bs [18:41] *** BnA-Rob1n has quit IRC (Remote host closed the connection) [19:19] *** BnA-Rob1n has joined #archiveteam-bs [19:33] *** Start has joined #archiveteam-bs [19:36] *** signius has quit IRC (Read error: Operation timed out) [19:43] *** Start has quit IRC (Quit: Disconnected.) [19:48] *** signius has joined #archiveteam-bs [19:49] *** Muad-Dib has joined #archiveteam-bs [20:06] *** signius has quit IRC (Read error: Operation timed out) [20:12] *** signius has joined #archiveteam-bs [20:16] http://fos.textfiles.com/ARCHIVETEAM/ is humming along nicely. [20:18] Looks like a new thing gets shoved into the archive every hour and a half right now. [20:24] *** schbirid has quit IRC (Quit: Leaving) [20:40] *** Honno has quit IRC (Read error: Operation timed out) [20:51] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [21:09] *** Sanqui has quit IRC (Quit: hop) [21:10] *** Sanky is now known as Sanqui [21:12] *** pwnsrv has joined #archiveteam-bs [21:14] *** pwnsrv has quit IRC (Read error: Connection reset by peer) [21:16] *** pwnsrv has joined #archiveteam-bs [21:40] *** Stiletto has quit IRC (Read error: Operation timed out) [21:56] *** pwnsrv has quit IRC (Ping timeout: 250 seconds) [22:06] *** Start has joined #archiveteam-bs [22:14] *** RedType has left [22:35] *** ndiddy has joined #archiveteam-bs [22:44] *** brayden has quit IRC (Read error: Connection reset by peer) [22:45] *** BlueMaxim has joined #archiveteam-bs [23:04] SketchCow: maybe able to get more manuals from radioshack [23:05] if anything else maybe web archives of it [23:05] my point is there are only 79 urls for this path: https://web.archive.org/web/*/support.radioshack.com/support_accessories/doc66/* [23:06] its possable that there close to 2000 when you look for html and pdf [23:10] *** Yoshimura has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) [23:18] godane: as you probably already know, the current root seems to be http://support.radioshack.com/productinfo [23:18] might be worth just dumping that in archivebot [23:21] JW_work : done :) [23:23] JW_work: not sure how well archivebot will take it, as it's a silly form thing instead of using hyperlinks like sensible people, but... let's find out! [23:23] * alfie does not know all too much about the inner workings of archivebot [23:23] archivebot clicks links [23:26] xmc: about as much as I expected, silly form things are incredibly silly. once you pick a category it starts giving actual links, i'll see how hard it'd be to enumerate all the categories [23:26] oh they're numbered, that's easy [23:27] nice [23:28] *** Medowar has quit IRC (Quit: Connection closed for inactivity) [23:43] it was already done: http://archive.fart.website/archivebot/viewer/domain/support.radioshack.com [23:44] i'm doing my own sort of grab cause i think archivebot didin't do it right [23:45] godane: i mean, it would be worthwhile throwing the list of links my script just spat out at archivebot anyway [23:45] partly because i just made all that effort ;) [23:45] does archivebot accept .txtss? [23:45] -s [23:48] my other point is i think archivebot didn't grab it right is this: [23:48] https://web.archive.org/web/*/support.radioshack.com/support_audio/doc22/* [23:49] only 11 urls have dates from 2013 and after [23:50] so based on that archivebot didn't grab any of these files or paths [23:50] the best of the worse case is that archivebot stuff for radioshack didn't get into wayback for some reason [23:50] i'll pump my list of URLs in anyway, what's the worst that can happen?