Time |
Nickname |
Message |
00:15
🔗
|
|
Maylay has joined #archiveteam-ot |
00:30
🔗
|
|
jason0597 has quit IRC (Read error: Operation timed out) |
00:37
🔗
|
|
wp494 has quit IRC (Read error: Connection reset by peer) |
00:38
🔗
|
|
wp494 has joined #archiveteam-ot |
01:03
🔗
|
|
maxfan8 has quit IRC (Quit: WeeChat 2.8) |
01:03
🔗
|
|
maxfan8 has joined #archiveteam-ot |
01:17
🔗
|
|
katocala has quit IRC () |
01:19
🔗
|
|
Arcorann has joined #archiveteam-ot |
02:48
🔗
|
|
t3 has quit IRC (Read error: Connection reset by peer) |
02:48
🔗
|
|
justcool3 has quit IRC (Read error: Connection reset by peer) |
02:48
🔗
|
|
Ctrl-S___ has quit IRC (Write error: Connection reset by peer) |
02:48
🔗
|
|
katocala has joined #archiveteam-ot |
02:50
🔗
|
|
t3 has joined #archiveteam-ot |
02:50
🔗
|
|
justcool3 has joined #archiveteam-ot |
02:53
🔗
|
|
Ctrl-S___ has joined #archiveteam-ot |
02:53
🔗
|
|
katocala has quit IRC (Client Quit) |
03:02
🔗
|
|
BlueMaxim has joined #archiveteam-ot |
03:04
🔗
|
|
katocala has joined #archiveteam-ot |
03:14
🔗
|
|
BlueMax has quit IRC (Ping timeout: 745 seconds) |
03:22
🔗
|
|
Arcorann has quit IRC (Read error: Connection reset by peer) |
03:42
🔗
|
|
qw3rty__ has joined #archiveteam-ot |
03:50
🔗
|
|
qw3rty_ has quit IRC (Read error: Operation timed out) |
04:03
🔗
|
|
benjins has quit IRC (Remote host closed the connection) |
04:05
🔗
|
|
benjins has joined #archiveteam-ot |
04:22
🔗
|
|
Arcorann has joined #archiveteam-ot |
05:20
🔗
|
|
BlueMax has joined #archiveteam-ot |
05:20
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
07:01
🔗
|
|
justcool3 has quit IRC (Quit: Connection closed for inactivity) |
07:01
🔗
|
|
godane has quit IRC (Read error: Connection reset by peer) |
07:20
🔗
|
|
godane has joined #archiveteam-ot |
07:50
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
07:53
🔗
|
VoynichCr |
speaking on small uncovered sites... how could we find small uncovered sites in an automated way? |
07:58
🔗
|
VoynichCr |
i could generate a list of official sites linked from wikidata |
07:59
🔗
|
VoynichCr |
and somebody a bot to estimate site size, last time update, CMS if any, etc |
07:59
🔗
|
VoynichCr |
and check WMB stats for that site, number of snapshots, etc |
09:32
🔗
|
OrIdow6 |
Wikidata is too narrow a source |
09:33
🔗
|
OrIdow6 |
I think you could combine it with everything you could get access to (outlinks from other Wikimedia things, those "lists of all domains" that float around, AB CDXs, etc.) |
09:33
🔗
|
OrIdow6 |
"You" in the generic |
09:46
🔗
|
VoynichCr |
i know, wikidata is just a curated sample of important domains, a first approach |
09:57
🔗
|
OrIdow6 |
Yes; I suppose how expansive you are depends on how much you scan |
09:57
🔗
|
OrIdow6 |
*how much you have the capacity to scan |
09:58
🔗
|
OrIdow6 |
I've thought that something like this (also looking for forums that have been overwhelmed by spam, etc.) would make a good "idle" warrior project alongside URLTeam |
11:14
🔗
|
|
BlueMax has joined #archiveteam-ot |
12:06
🔗
|
|
Maylay has quit IRC (Ping timeout: 265 seconds) |
12:18
🔗
|
|
Maylay has joined #archiveteam-ot |
12:58
🔗
|
|
jason0597 has joined #archiveteam-ot |
13:24
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
14:28
🔗
|
|
HP_Archiv has quit IRC (Read error: Connection reset by peer) |
15:28
🔗
|
|
Arcorann has quit IRC (Read error: Connection reset by peer) |
16:12
🔗
|
|
jason0597 has quit IRC (Read error: Operation timed out) |
17:33
🔗
|
|
VerifiedJ has joined #archiveteam-ot |
17:45
🔗
|
Frogging |
JAA: who the hell is that |
17:46
🔗
|
JAA |
Frogging: joaquinit |
17:46
🔗
|
Frogging |
oh |
17:46
🔗
|
Frogging |
right |
17:48
🔗
|
Frogging |
weird troll |
18:28
🔗
|
|
britmob_ has quit IRC (Read error: Connection reset by peer) |
18:28
🔗
|
|
britmob has joined #archiveteam-ot |
19:26
🔗
|
|
HP_Archiv has joined #archiveteam-ot |
21:26
🔗
|
|
t3 has quit IRC (Quit: Connection closed for inactivity) |
21:27
🔗
|
|
VerifiedJ has quit IRC (Quit: Leaving) |
21:37
🔗
|
|
Raccoon has quit IRC (Ping timeout: 265 seconds) |
22:24
🔗
|
|
DigiDigi has quit IRC (Read error: Operation timed out) |
22:45
🔗
|
|
godane has quit IRC (Quit: Leaving.) |
22:51
🔗
|
|
DigiDigi has joined #archiveteam-ot |
23:13
🔗
|
Ryz |
...Interesting, seems to be sticking around; for some reason when clicking a search result in DuckDuckGo, the address bar quickly changes into have some garbled up stuff added onto it (probably tracking?) before you were able to go into said link |
23:13
🔗
|
Ryz |
It's been like that for at least a week now |
23:36
🔗
|
|
BlueMax has joined #archiveteam-ot |