| Time |
Nickname |
Message |
|
00:15
🔗
|
|
Maylay has joined #archiveteam-ot |
|
00:30
🔗
|
|
jason0597 has quit IRC (Read error: Operation timed out) |
|
00:37
🔗
|
|
wp494 has quit IRC (Read error: Connection reset by peer) |
|
00:38
🔗
|
|
wp494 has joined #archiveteam-ot |
|
01:03
🔗
|
|
maxfan8 has quit IRC (Quit: WeeChat 2.8) |
|
01:03
🔗
|
|
maxfan8 has joined #archiveteam-ot |
|
01:17
🔗
|
|
katocala has quit IRC () |
|
01:19
🔗
|
|
Arcorann has joined #archiveteam-ot |
|
02:48
🔗
|
|
t3 has quit IRC (Read error: Connection reset by peer) |
|
02:48
🔗
|
|
justcool3 has quit IRC (Read error: Connection reset by peer) |
|
02:48
🔗
|
|
Ctrl-S___ has quit IRC (Write error: Connection reset by peer) |
|
02:48
🔗
|
|
katocala has joined #archiveteam-ot |
|
02:50
🔗
|
|
t3 has joined #archiveteam-ot |
|
02:50
🔗
|
|
justcool3 has joined #archiveteam-ot |
|
02:53
🔗
|
|
Ctrl-S___ has joined #archiveteam-ot |
|
02:53
🔗
|
|
katocala has quit IRC (Client Quit) |
|
03:02
🔗
|
|
BlueMaxim has joined #archiveteam-ot |
|
03:04
🔗
|
|
katocala has joined #archiveteam-ot |
|
03:14
🔗
|
|
BlueMax has quit IRC (Ping timeout: 745 seconds) |
|
03:22
🔗
|
|
Arcorann has quit IRC (Read error: Connection reset by peer) |
|
03:42
🔗
|
|
qw3rty__ has joined #archiveteam-ot |
|
03:50
🔗
|
|
qw3rty_ has quit IRC (Read error: Operation timed out) |
|
04:03
🔗
|
|
benjins has quit IRC (Remote host closed the connection) |
|
04:05
🔗
|
|
benjins has joined #archiveteam-ot |
|
04:22
🔗
|
|
Arcorann has joined #archiveteam-ot |
|
05:20
🔗
|
|
BlueMax has joined #archiveteam-ot |
|
05:20
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
|
07:01
🔗
|
|
justcool3 has quit IRC (Quit: Connection closed for inactivity) |
|
07:01
🔗
|
|
godane has quit IRC (Read error: Connection reset by peer) |
|
07:20
🔗
|
|
godane has joined #archiveteam-ot |
|
07:50
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
|
07:53
🔗
|
VoynichCr |
speaking on small uncovered sites... how could we find small uncovered sites in an automated way? |
|
07:58
🔗
|
VoynichCr |
i could generate a list of official sites linked from wikidata |
|
07:59
🔗
|
VoynichCr |
and somebody a bot to estimate site size, last time update, CMS if any, etc |
|
07:59
🔗
|
VoynichCr |
and check WMB stats for that site, number of snapshots, etc |
|
09:32
🔗
|
OrIdow6 |
Wikidata is too narrow a source |
|
09:33
🔗
|
OrIdow6 |
I think you could combine it with everything you could get access to (outlinks from other Wikimedia things, those "lists of all domains" that float around, AB CDXs, etc.) |
|
09:33
🔗
|
OrIdow6 |
"You" in the generic |
|
09:46
🔗
|
VoynichCr |
i know, wikidata is just a curated sample of important domains, a first approach |
|
09:57
🔗
|
OrIdow6 |
Yes; I suppose how expansive you are depends on how much you scan |
|
09:57
🔗
|
OrIdow6 |
*how much you have the capacity to scan |
|
09:58
🔗
|
OrIdow6 |
I've thought that something like this (also looking for forums that have been overwhelmed by spam, etc.) would make a good "idle" warrior project alongside URLTeam |
|
11:14
🔗
|
|
BlueMax has joined #archiveteam-ot |
|
12:06
🔗
|
|
Maylay has quit IRC (Ping timeout: 265 seconds) |
|
12:18
🔗
|
|
Maylay has joined #archiveteam-ot |
|
12:58
🔗
|
|
jason0597 has joined #archiveteam-ot |
|
13:24
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
|
14:28
🔗
|
|
HP_Archiv has quit IRC (Read error: Connection reset by peer) |
|
15:28
🔗
|
|
Arcorann has quit IRC (Read error: Connection reset by peer) |
|
16:12
🔗
|
|
jason0597 has quit IRC (Read error: Operation timed out) |
|
17:33
🔗
|
|
VerifiedJ has joined #archiveteam-ot |
|
17:45
🔗
|
Frogging |
JAA: who the hell is that |
|
17:46
🔗
|
JAA |
Frogging: joaquinit |
|
17:46
🔗
|
Frogging |
oh |
|
17:46
🔗
|
Frogging |
right |
|
17:48
🔗
|
Frogging |
weird troll |
|
18:28
🔗
|
|
britmob_ has quit IRC (Read error: Connection reset by peer) |
|
18:28
🔗
|
|
britmob has joined #archiveteam-ot |
|
19:26
🔗
|
|
HP_Archiv has joined #archiveteam-ot |
|
21:26
🔗
|
|
t3 has quit IRC (Quit: Connection closed for inactivity) |
|
21:27
🔗
|
|
VerifiedJ has quit IRC (Quit: Leaving) |
|
21:37
🔗
|
|
Raccoon has quit IRC (Ping timeout: 265 seconds) |
|
22:24
🔗
|
|
DigiDigi has quit IRC (Read error: Operation timed out) |
|
22:45
🔗
|
|
godane has quit IRC (Quit: Leaving.) |
|
22:51
🔗
|
|
DigiDigi has joined #archiveteam-ot |
|
23:13
🔗
|
Ryz |
...Interesting, seems to be sticking around; for some reason when clicking a search result in DuckDuckGo, the address bar quickly changes into have some garbled up stuff added onto it (probably tracking?) before you were able to go into said link |
|
23:13
🔗
|
Ryz |
It's been like that for at least a week now |
|
23:36
🔗
|
|
BlueMax has joined #archiveteam-ot |