Time |
Nickname |
Message |
00:41
🔗
|
|
Start_ has joined #webroasting |
00:41
🔗
|
|
Start has quit IRC (Read error: Connection reset by peer) |
00:50
🔗
|
|
Start_ is now known as Start |
00:50
🔗
|
|
svchfoo3 sets mode: +o Start |
08:10
🔗
|
|
wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES) |
08:24
🔗
|
|
wp494 has joined #webroasting |
08:24
🔗
|
|
svchfoo3 sets mode: +o wp494 |
15:41
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
18:29
🔗
|
|
Start has joined #webroasting |
19:19
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
19:36
🔗
|
|
Start has joined #webroasting |
19:59
🔗
|
|
Ctrl-S___ has quit IRC (Write error: Broken pipe) |
19:59
🔗
|
|
_desu___ has quit IRC (Write error: Broken pipe) |
20:10
🔗
|
|
Start_ has joined #webroasting |
20:10
🔗
|
|
Start has quit IRC (Read error: Connection reset by peer) |
20:11
🔗
|
|
chfoo has quit IRC (Read error: Operation timed out) |
20:26
🔗
|
|
chfoo has joined #webroasting |
20:40
🔗
|
|
_desu___ has joined #webroasting |
20:45
🔗
|
|
Ctrl-S___ has joined #webroasting |
20:45
🔗
|
|
Start_ has quit IRC (Quit: Disconnected.) |
20:46
🔗
|
|
svchfoo3 sets mode: +o Ctrl-S___ |
20:49
🔗
|
|
Start has joined #webroasting |
22:16
🔗
|
|
arkiver has joined #webroasting |
22:17
🔗
|
arkiver |
hi |
22:19
🔗
|
SimpBrain |
isp web hosting is like the next gen after geocities |
22:20
🔗
|
SimpBrain |
some sites like geocities styling, other are good :) |
22:23
🔗
|
Start |
we should probably start with hosts that publicly offer lists of their sites like chebucto: http://www.chebucto.ns.ca/subject.shtml |
22:39
🔗
|
Start |
for web hosts without public lists we'll just scrape google & other sites |
22:40
🔗
|
Start |
then as we download the sites we can scrape their html to discover more |
22:41
🔗
|
arkiver |
Or I can allow in the scripts to download other sites hosted by the same company that it finds while archiving the websites too |
22:46
🔗
|
|
chfoo has quit IRC (Read error: Connection reset by peer) |
22:47
🔗
|
Start |
that's a good idea, only problem with is it might result in multiple downloads of a site if it's referenced by many others |
22:55
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
22:56
🔗
|
|
Start-mob has joined #webroasting |
23:03
🔗
|
|
chfoo has joined #webroasting |
23:28
🔗
|
|
Start-mob has quit IRC (Quit: Leaving) |
23:50
🔗
|
|
Start has joined #webroasting |
23:50
🔗
|
|
svchfoo3 sets mode: +o Start |