[00:41] *** Start_ has joined #webroasting [00:41] *** Start has quit IRC (Read error: Connection reset by peer) [00:50] *** Start_ is now known as Start [00:50] *** svchfoo3 sets mode: +o Start [08:10] *** wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES) [08:24] *** wp494 has joined #webroasting [08:24] *** svchfoo3 sets mode: +o wp494 [15:41] *** Start has quit IRC (Quit: Disconnected.) [18:29] *** Start has joined #webroasting [19:19] *** Start has quit IRC (Quit: Disconnected.) [19:36] *** Start has joined #webroasting [19:59] *** Ctrl-S___ has quit IRC (Write error: Broken pipe) [19:59] *** _desu___ has quit IRC (Write error: Broken pipe) [20:10] *** Start_ has joined #webroasting [20:10] *** Start has quit IRC (Read error: Connection reset by peer) [20:11] *** chfoo has quit IRC (Read error: Operation timed out) [20:26] *** chfoo has joined #webroasting [20:40] *** _desu___ has joined #webroasting [20:45] *** Ctrl-S___ has joined #webroasting [20:45] *** Start_ has quit IRC (Quit: Disconnected.) [20:46] *** svchfoo3 sets mode: +o Ctrl-S___ [20:49] *** Start has joined #webroasting [22:16] *** arkiver has joined #webroasting [22:17] hi [22:19] isp web hosting is like the next gen after geocities [22:20] some sites like geocities styling, other are good :) [22:23] we should probably start with hosts that publicly offer lists of their sites like chebucto: http://www.chebucto.ns.ca/subject.shtml [22:39] for web hosts without public lists we'll just scrape google & other sites [22:40] then as we download the sites we can scrape their html to discover more [22:41] Or I can allow in the scripts to download other sites hosted by the same company that it finds while archiving the websites too [22:46] *** chfoo has quit IRC (Read error: Connection reset by peer) [22:47] that's a good idea, only problem with is it might result in multiple downloads of a site if it's referenced by many others [22:55] *** Start has quit IRC (Quit: Disconnected.) [22:56] *** Start-mob has joined #webroasting [23:03] *** chfoo has joined #webroasting [23:28] *** Start-mob has quit IRC (Quit: Leaving) [23:50] *** Start has joined #webroasting [23:50] *** svchfoo3 sets mode: +o Start