[00:06] *** BlueMaxim has joined #archiveteam [00:15] *** JesseW has quit IRC (Quit: Leaving.) [00:39] *** tomwsmf-a has quit IRC (Ping timeout: 258 seconds) [00:48] *** JesseW has joined #archiveteam [01:12] *** primus104 has quit IRC (Leaving.) [01:41] *** JesseW has quit IRC (Quit: Leaving.) [01:43] *** Spirit has quit IRC (Read error: Operation timed out) [01:48] *** Froggypwn has quit IRC (Ping timeout: 306 seconds) [01:48] *** Froggypwn has joined #archiveteam [01:56] *** Spirit has joined #archiveteam [01:57] *** JesseW has joined #archiveteam [02:28] *** JesseW has quit IRC (Quit: Leaving.) [02:30] *** taumons has joined #archiveteam [02:31] *** szalwia has quit IRC (Read error: Connection reset by peer) [02:31] hi all, having trouble starting a warrior for the first time. Here's the error I'm getting: http://i.imgur.com/xb7EMKY.png [02:32] once it starts (without downloading the newest code), I can access the web interface, but it can't see any projects for me to choose. [02:32] I've deleted and remade the VM in virtualbox once, but no luck there. thoughts? [02:35] *** szalwia has joined #archiveteam [02:36] *** yuvadm has quit IRC (Read error: Operation timed out) [02:36] *** tephra has quit IRC (Read error: Operation timed out) [02:37] *** JesseW has joined #archiveteam [02:40] *** tephra has joined #archiveteam [02:46] taumons: can the host resolve github.com and did you change any network settings? [02:48] host can resolve it fine, did not change any net settings [02:48] I found this on the Archiveteam wiki, but the suggested steps did not help: http://askubuntu.com/questions/204953/virtualbox-dns-stopped-working-on-upgrade-to-12-10 [02:52] *** yuvadm has joined #archiveteam [02:52] I went to a VT in the VM and can ping github just fine.... [02:52] and git works fine with GH [02:58] *** oldcad has quit IRC (Quit: Leaving.) [03:20] *** mistym has quit IRC (Remote host closed the connection) [03:25] https://publish.comcast.net/splash/ Effective October 8, 2015, the Personal Web Page service (feature of XFINITY Internet) will no longer be available. [03:27] nooooooooooooo my webpages [03:27] actually this is not good [03:34] KNEW IT [03:34] FUCKING KNEW IT [03:34] *** taumons has quit IRC (Quit: Page closed) [03:35] but hey at least comcast gave a warning [03:35] unlike verizon, apparently [03:35] thankfully the one comcast site i wanted saved i already saved for myself [03:35] and if i get the original person's permission, i will reupload it onto neocities [03:39] *** Froggypwn has quit IRC (Ping timeout: 483 seconds) [03:50] time to update the wiki [03:52] irc channel: #comclose [04:04] *** mistym has joined #archiveteam [04:13] *** Yiffiel_d has quit IRC (Ping timeout: 252 seconds) [04:15] paste.archivingyoursh.it is giving a 503 [04:22] *** tomwsmf-a has joined #archiveteam [04:24] *** JesseW has quit IRC (Quit: Leaving.) [04:31] What is being discussed here [04:31] I just re-read [04:31] do not do crowdfunding for this. [04:32] Just prioritize highest viewed blip.tvs if possible, and we go until we get yelled at. [04:35] *** bsmith096 has joined #archiveteam [04:36] So, here's the deal. [04:36] I'm going to be flying to North Carolina tomorrow. [04:36] I am stuck there for a week at least. (Taking care of family member) [04:36] I will have LOTS of time. [04:37] I'd like to talk about ALL things archiveteam then. [04:37] Make sense? [04:37] SketchCow: didn't blip.tv purge a bunch of old stuff a few years ago? [04:39] Yes [04:39] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [04:43] *** kyan has quit IRC (Quit: This computer has gone to sleep) [04:43] *** kyan has joined #archiveteam [04:44] *** aaaaaaaaa has quit IRC (Leaving) [04:45] *** BlueMaxim has joined #archiveteam [04:52] *** Stilett0 has joined #archiveteam [04:57] *** Stiletto has quit IRC (Ping timeout: 483 seconds) [05:07] sweet [05:07] i'd love to hear what you would have to say [05:08] SketchCow: ping. please join #warrior, we're completing the trakcer migration and need dns changed [05:20] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [05:28] SketchCow: Ok, but why do you not want crowdfunding? It'd help pay IA for storage and we won't be yelled at for what we grab and upload. [05:28] But we'll discuss all that tomorrow [05:33] SketchCow "i'm ulpoading to fos now, i'm very nearly done, or at least almost caught up with the flood of stories at ffnet. one thing though, because of how i saved them. some of the folders and files have a period as their first character in the name, and are hidden, watch out for those when compressing. [05:33] you probably knew that already, just a reminder. thanks for the space :) much appreciated [05:37] bsmith096: where on fos is this going [05:38] yipdw: inside bsmith, its the "Fanfiction" folder [05:38] just started uploading a few minutes ago [05:38] ok, I don't think you're on fos [05:38] maybe sis [05:38] fos.textfiles.com [05:39] I'm shelled in [05:39] is that another thing? [05:39] I'm trying to make sure that this shit handles your crazy filenames [05:39] they upload just fine, the names get scrubbed with underscores by my download script, i think [05:40] I'm looking at a different phase [05:40] for example, "Pokemon" [05:40] with the accented e becomes "Pok_mon" [05:40] anyway, is "bsmith" the rsync module name or something else [05:40] thats the folder where they are going, inside that is a folder Fanfiction [05:40] are you accessing this via rsync, or what transport [05:41] rsync [05:41] what is the rsync path [05:42] "bsmith/" [05:42] what is the full rsync path [05:43] I'm asking this because there is no bsmith rsync module and I am mighty confused as to what you are pointing to [05:44] ummm, ok, i think i get it. [05:44] here wacko@fos.textfiles.com/bsmith [05:44] ok, I found it [05:45] so, the Fanfiction folder is where its all going, inside that is a frankly, unreasonably huge number of categories [05:45] I noticed [05:47] they'll probably be ok [05:47] FYI you don't need to corrupt filenames by eliminating acute-accent-es and the like [05:47] UNIX filesystems can deal with those [05:47] some of them will be hidden, b/c they happen to start with a dot [05:48] that's fine [05:48] they still exist :) [05:48] only real problem is people like to use / characters [05:48] i didnt it was automatic, by the download script, each file has a chuck of metadata at the beginning, and thats fine [05:48] * xmc nod [05:56] *** JesseW has joined #archiveteam [06:04] *** Start has quit IRC (Read error: Connection reset by peer) [06:05] *** Start has joined #archiveteam [06:13] *** bsmith096 has quit IRC (Ping timeout: 240 seconds) [06:23] *** bsmith096 has joined #archiveteam [06:30] *** RichardG has quit IRC (Remote host closed the connection) [06:37] *** primus104 has joined #archiveteam [06:49] *** JesseW has quit IRC (Quit: Leaving.) [07:00] *** primus104 has quit IRC (Leaving.) [07:17] arkiver: you could always do it yourself and them dump the donation to IA [07:24] *** primus104 has joined #archiveteam [07:45] *** Medowar has joined #archiveteam [07:48] *** atomotic has joined #archiveteam [07:57] *** signius has quit IRC (Ping timeout: 306 seconds) [08:09] *** signius has joined #archiveteam [08:28] Indeed [09:02] *** Spritecla has quit IRC (Quit: [Quit message changed because SOME-body is an asshole.]) [09:18] *** ohhdemgir has quit IRC (Read error: Operation timed out) [09:32] *** garyrh has quit IRC (Read error: No route to host) [10:04] *** dashcloud has quit IRC (Read error: Operation timed out) [10:08] *** dashcloud has joined #archiveteam [10:15] *** Ravenloft has quit IRC (Read error: Operation timed out) [10:38] *** garyrh has joined #archiveteam [10:43] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [11:32] *** primus104 has quit IRC (Leaving.) [12:20] *** atomotic has joined #archiveteam [12:25] *** edsu_ is now known as edsu [12:28] *** RichardG has joined #archiveteam [12:51] *** Froggypwn has joined #archiveteam [13:10] *** Stilett0 has quit IRC (Read error: Connection reset by peer) [13:12] *** Stiletto has joined #archiveteam [13:18] *** tomwsmf-a has joined #archiveteam [13:36] *** brayden has joined #archiveteam [13:36] *** swebb sets mode: +o brayden [13:47] *** useretail has quit IRC (...) [13:50] *** xk_id has joined #archiveteam [13:54] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [14:02] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [14:03] *** Froggypwn has joined #archiveteam [14:03] *** SadDM has quit IRC (Ping timeout: 483 seconds) [14:04] *** xk_id has quit IRC (Remote host closed the connection) [14:07] *** SadDM has joined #archiveteam [14:07] *** swebb sets mode: +o SadDM [14:11] *** lytv has quit IRC (Read error: Operation timed out) [14:14] *** lytv has joined #archiveteam [14:15] *** useretail has joined #archiveteam [14:25] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [14:25] *** Froggypwn has joined #archiveteam [14:26] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [14:28] fwiw AGDQ uploads their recordings to archive.org right from the source every year, no need to stream rip [14:28] getting twitch comments would be interesting though [14:28] *** Froggypwn has joined #archiveteam [14:30] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [14:30] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [14:33] *** Froggypwn has joined #archiveteam [14:47] *** mistym has quit IRC (Remote host closed the connection) [14:58] *** mistym has joined #archiveteam [15:05] *** primus104 has joined #archiveteam [15:27] *** xk_id has joined #archiveteam [15:38] *** phiren has quit IRC (Ping timeout: 506 seconds) [15:38] *** phuzion has quit IRC (Read error: Operation timed out) [15:44] *** phuzion has joined #archiveteam [15:47] *** mistym has quit IRC (Remote host closed the connection) [15:48] *** scyther has joined #archiveteam [15:48] *** JesseW has joined #archiveteam [16:01] *** xk_id has quit IRC (Remote host closed the connection) [16:01] *** JesseW has quit IRC (Quit: Leaving.) [16:02] *** xk_id has joined #archiveteam [16:09] *** primus104 has quit IRC (Leaving.) [16:14] *** vOYtEC has quit IRC (Read error: Operation timed out) [16:20] *** xk_id has quit IRC (Remote host closed the connection) [16:29] *** mistym has joined #archiveteam [16:32] *** mistym_ has joined #archiveteam [16:34] *** mistym has quit IRC (Read error: Operation timed out) [16:35] chfoo: can you please add frontback to projects.json? [16:36] and add a FOS rsync? [16:37] Frontback is removing everything on the 15th of august [16:42] *** mistym_ is now known as mistym [16:45] *** Muad-Dib has quit IRC (Ping timeout: 252 seconds) [16:47] *** Muad-Dib has joined #archiveteam [16:50] *** xk_id has joined #archiveteam [17:12] *** oli has left [17:16] *** aaaaaaaaa has joined #archiveteam [17:16] *** swebb sets mode: +o aaaaaaaaa [17:29] *** Jonimus has quit IRC (Ping timeout: 483 seconds) [17:30] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [17:36] *** aaaaaaaaa has quit IRC (Leaving) [17:38] *** Jonimus has joined #archiveteam [17:38] *** nmnn has joined #archiveteam [17:49] *** oldcad has joined #archiveteam [17:51] *** Jonimus has quit IRC (Ping timeout: 483 seconds) [18:01] *** Jonimus has joined #archiveteam [18:05] *** Spritecla has joined #archiveteam [18:08] *** ohhdemgir has joined #archiveteam [18:09] *** SketchCow has quit IRC (Read error: Operation timed out) [18:11] *** primus104 has joined #archiveteam [18:16] *** primus105 has joined #archiveteam [18:18] *** primus104 has quit IRC (Read error: Operation timed out) [18:26] *** SketchCow has joined #archiveteam [18:26] *** swebb sets mode: +o SketchCow [18:28] *** Stiletto has quit IRC () [18:45] arkiver: ok, added [18:50] *** arkhive has joined #archiveteam [18:54] *** scyther has quit IRC (Read error: Connection reset by peer) [19:07] *** rejon has joined #archiveteam [19:11] chfoo: thanks!! [19:16] *** rejon has quit IRC (Ping timeout: 258 seconds) [19:18] *** human39 has joined #archiveteam [19:23] *** aaaaaaaaa has joined #archiveteam [19:23] *** swebb sets mode: +o aaaaaaaaa [19:31] *** mistym has quit IRC (Remote host closed the connection) [19:37] *** dashcloud has quit IRC (Ping timeout: 252 seconds) [19:41] *** dashcloud has joined #archiveteam [20:22] *** nmnn has quit IRC (Ping timeout: 483 seconds) [20:27] *** mistym has joined #archiveteam [20:28] *** garyrh has quit IRC (Read error: Connection reset by peer) [20:28] *** dashcloud has quit IRC (Read error: Operation timed out) [20:30] *** dashcloud has joined #archiveteam [20:46] We're almost starting the Frontback grab #frontbash [20:49] arkiver: does it require the tracker? if yes, better to wait on kicking it off until tomorrow or so [20:49] xmc: ok, please leave me a message as soon as the tracker can be used again [20:49] sure thing [20:56] *** Balrog_ has joined #archiveteam [20:58] hey, are there any Warrior devs present? [20:58] *** nertzy has quit IRC (Remote host closed the connection) [21:06] *** arkiver has left [21:06] *** arkiver has joined #archiveteam [21:13] *** signius has quit IRC (Read error: Operation timed out) [21:14] *** vOYtEC has joined #archiveteam [21:15] what's the accepteed way to have a seesaw script scrap HTML from a page and dump it as raw data [21:17] I would think it would be as an ExternalProcess [21:18] well I would like to format it nicely first [21:18] wait, when you mean "dump it as raw data" do you mean output a bunch of html files? [21:18] no [21:19] raw data isn't right actually [21:19] I want to take info from a page and transform it into a nicely structured text file for later parsing [21:20] so I could say "put the info in these spans into these rows of this CSV [21:27] and now I'm getting pinged all over >< [21:27] (balrog_ is not me...) [21:27] *** signius has joined #archiveteam [21:27] whuh [21:28] what client are you using [21:28] it's just going to cause lots of confusion :) [21:29] wait are you two different people [21:29] yes [21:29] o [21:29] one of you needs to change your name [21:29] I use balrog for everything and sometimes it fucks up [21:29] *** Balrog_ is now known as Balrog-wa [21:30] ok [21:30] still confusing, but better [21:30] well I've been using it on IRC for 8 years and don't intend to change it [21:30] also balrog you have an @-sign [21:31] yeah that's better [21:31] that just means ops [21:31] and at sign? [21:31] oh [21:31] xmc: _ means confusion with PMs and PMs potentially going to the wrong person [21:31] anyway thanks, and sorry for the noise [21:31] yeah I'm just a random dummy trying to save things nobody else seems to care about [21:31] yeah, also hard to distinguish in the channel [21:40] *** Spritecla has quit IRC (Ping timeout: 306 seconds) [21:43] *** dashcloud has quit IRC (Read error: Operation timed out) [21:44] *** mistym_ has joined #archiveteam [21:45] *** Spritecla has joined #archiveteam [21:48] *** dashcloud has joined #archiveteam [21:50] *** mistym has quit IRC (Read error: Operation timed out) [21:50] *** underscor has quit IRC (Read error: Connection reset by peer) [21:59] *** xtr-201 has quit IRC (Read error: Operation timed out) [22:05] *** arkhive has quit IRC (Read error: Operation timed out) [22:12] *** mistym_ has quit IRC (Remote host closed the connection) [22:14] *** Yiffiel_d has joined #archiveteam [22:15] Heeeey got a project, but I don't know if they will allow you guys direct access or not. [22:16] http://america.aljazeera.com/watch/shows/america-tonight/articles/2014/12/10/debate-gamergate.html went to this article, wanted to get just the text of the debate [22:16] http://branch.com/?ref=embed aaaaand it turns out they're going to die [22:19] https://archive.is/M5Ytv as ya can see, it's not a full save. [22:19] *** mistym has joined #archiveteam [22:46] *** underscor has joined #archiveteam [23:00] *** Muad-Dib has quit IRC (Quit: ZNC - http://znc.in) [23:18] *** Stiletto has joined #archiveteam [23:24] >sign into twitter [23:24] dayum [23:24] that could be problematic [23:26] *** dashcloud has quit IRC (Read error: Operation timed out) [23:29] *** dashcloud has joined #archiveteam [23:38] *** arkhive has joined #archiveteam [23:38] *** arkhive has quit IRC (Client Quit) [23:47] *** garyrh has joined #archiveteam [23:57] Yiffiel_d: we have a copy running at #archivebot right now; in a day or so you can download the WARC and extract the text from the response records