Time |
Nickname |
Message |
00:10
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
00:11
🔗
|
|
nekomune has quit IRC (Ping timeout: 268 seconds) |
00:13
🔗
|
|
philpem has quit IRC (Ping timeout: 252 seconds) |
00:14
🔗
|
|
nekomune has joined #archiveteam |
00:17
🔗
|
|
Laverne has quit IRC (Read error: Operation timed out) |
00:21
🔗
|
|
Laverne has joined #archiveteam |
00:26
🔗
|
|
zenguy_pc has joined #archiveteam |
00:42
🔗
|
|
ete has quit IRC (Read error: Connection reset by peer) |
00:42
🔗
|
|
JesseW has joined #archiveteam |
00:48
🔗
|
|
kb9mwr has joined #archiveteam |
00:50
🔗
|
kb9mwr |
hello |
00:59
🔗
|
|
phiren has quit IRC (Ping timeout: 506 seconds) |
00:59
🔗
|
|
xk_id has quit IRC (Remote host closed the connection) |
01:00
🔗
|
|
kb9mwr has quit IRC (Quit: Page closed) |
01:10
🔗
|
|
phiren has joined #archiveteam |
01:36
🔗
|
|
primus104 has quit IRC (Leaving.) |
01:37
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
01:37
🔗
|
|
Silvan has quit IRC (Read error: Operation timed out) |
01:41
🔗
|
|
SilSte has joined #archiveteam |
01:47
🔗
|
|
JesseW has joined #archiveteam |
02:15
🔗
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
02:15
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
02:26
🔗
|
|
zenguy_pc has joined #archiveteam |
02:39
🔗
|
|
nertzy has joined #archiveteam |
03:07
🔗
|
pokeball9 |
SketchCow we might need to start archiving imageshack:http://puu.sh/kPr0L/5a38e9aa47.png |
03:07
🔗
|
pokeball9 |
If there doing this,then they will likely start deleting images after a time |
03:08
🔗
|
pikhq |
pokeball9: Though concerning, that's not exactly indication they're going to go ahead and start deleting stuff. Good to keep an eye out for though. |
03:22
🔗
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
03:33
🔗
|
aaaaaaaaa |
way to shoot yourself in the foot, again, imageshack |
03:34
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
03:36
🔗
|
|
JesseW has joined #archiveteam |
03:41
🔗
|
pokeball9 |
Again aaaaaaaaa? |
03:42
🔗
|
aaaaaaaaa |
the 1 year rule, amongst others |
03:57
🔗
|
|
Ungstein has joined #archiveteam |
04:12
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
04:21
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
04:22
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
04:27
🔗
|
|
zenguy_pc has joined #archiveteam |
06:11
🔗
|
|
Dark_Star has quit IRC (Ping timeout: 360 seconds) |
06:16
🔗
|
|
WinterFox has joined #archiveteam |
06:17
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
06:26
🔗
|
|
zenguy_pc has joined #archiveteam |
07:07
🔗
|
|
anomie has quit IRC (Read error: Connection reset by peer) |
07:14
🔗
|
|
anomie has joined #archiveteam |
07:17
🔗
|
|
primus104 has joined #archiveteam |
07:33
🔗
|
|
godane has quit IRC (Quit: Leaving.) |
07:53
🔗
|
|
atomotic has joined #archiveteam |
07:59
🔗
|
WubTheCap |
g.zxq.co is (now) deleting user uploaded content at will https://git.pantsu.cat/WubTheCaptain/deathwatch-pomf#gzxqco |
08:10
🔗
|
|
primus104 has quit IRC (Leaving.) |
08:21
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
08:26
🔗
|
|
primus104 has joined #archiveteam |
08:26
🔗
|
|
zenguy_pc has joined #archiveteam |
08:35
🔗
|
|
Ungstein has quit IRC (Ping timeout: 252 seconds) |
08:49
🔗
|
|
Ungstein has joined #archiveteam |
08:50
🔗
|
|
xk_id has joined #archiveteam |
08:59
🔗
|
|
jspiros has joined #archiveteam |
09:06
🔗
|
|
godane has joined #archiveteam |
09:30
🔗
|
|
GLaDOS has quit IRC (Ping timeout: 252 seconds) |
09:31
🔗
|
|
GLaDOS has joined #archiveteam |
09:37
🔗
|
|
godane has quit IRC (Quit: Leaving.) |
09:59
🔗
|
|
bzc6p__ has joined #archiveteam |
09:59
🔗
|
|
swebb sets mode: +o bzc6p__ |
10:06
🔗
|
|
bzc6p_ has quit IRC (Ping timeout: 615 seconds) |
10:16
🔗
|
|
VADemon has joined #archiveteam |
10:19
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
10:20
🔗
|
|
pokeball9 has quit IRC (Quit: Connection closed for inactivity) |
10:23
🔗
|
|
xk_id has quit IRC (Remote host closed the connection) |
10:24
🔗
|
|
xk_id has joined #archiveteam |
10:26
🔗
|
|
Ravenloft has quit IRC (Remote host closed the connection) |
10:29
🔗
|
|
zenguy_pc has joined #archiveteam |
10:32
🔗
|
|
schbirid has joined #archiveteam |
10:32
🔗
|
|
xk_id has quit IRC (Read error: Operation timed out) |
10:38
🔗
|
|
primus104 has quit IRC (Leaving.) |
10:43
🔗
|
Yiffiel_d |
Anyone enjoy a good bit of porn with their archiving? |
10:44
🔗
|
Yiffiel_d |
Lengthy project available: http://blog.livedoor.jp/insidears/archives/52712738.html |
10:44
🔗
|
Yiffiel_d |
https://www.lewdgamer.com/2015/10/17/legendary-eroge-developer-elf-closes-doors/ |
10:44
🔗
|
Yiffiel_d |
oh, also must know moonrunes |
10:45
🔗
|
Yiffiel_d |
man, 27 years. Almost as old as Nintendo! |
11:07
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
11:07
🔗
|
|
trs80 has quit IRC (Ping timeout: 186 seconds) |
11:07
🔗
|
|
xk_id has joined #archiveteam |
11:23
🔗
|
|
primus104 has joined #archiveteam |
11:25
🔗
|
|
midas1 is now known as midas |
11:43
🔗
|
|
pokeball9 has joined #archiveteam |
12:15
🔗
|
|
WinterFox has quit IRC (Remote host closed the connection) |
12:19
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
12:19
🔗
|
|
atomotic has joined #archiveteam |
12:23
🔗
|
|
Atluxity has joined #archiveteam |
12:23
🔗
|
Atluxity |
would any of you happen to know if there is way to query what domain names the waybackmachine knows about? |
12:24
🔗
|
Atluxity |
and then in the end use that to get a complete list of "every" domain under a certain TLD |
12:24
🔗
|
|
Kagee has joined #archiveteam |
12:24
🔗
|
joepie91 |
Atluxity: I don't believe so: https://archive.org/help/wayback_api.php |
12:29
🔗
|
WubTheCap |
Atluxity: http://jordan-wright.com/blog/2015/09/30/how-to-download-a-list-of-all-registered-domain-names/ |
12:29
🔗
|
WubTheCap |
Was featured recently on HN |
12:29
🔗
|
|
zenguy_pc has joined #archiveteam |
12:30
🔗
|
Atluxity |
WubTheCap: thanks! |
12:30
🔗
|
Atluxity |
ah, yes |
12:30
🔗
|
Atluxity |
I saw that |
12:31
🔗
|
Atluxity |
but this does not cover my tld of preference |
12:31
🔗
|
|
vitzli has joined #archiveteam |
12:31
🔗
|
Atluxity |
they do not want to publish any zones |
12:31
🔗
|
Atluxity |
the regard it as their copyrighted property |
12:41
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
12:54
🔗
|
|
scyther has joined #archiveteam |
13:41
🔗
|
|
primus104 has quit IRC (Leaving.) |
13:57
🔗
|
|
Atom__ has joined #archiveteam |
14:19
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
14:23
🔗
|
vitzli |
Two weeks ago Yandex removed old interface on its Kinopoisk.ru website (and butchered some of the features and user-created reviews), which ended up in user protest, and some time later they brought the old interface back, but Yandex says that it is a temporary solution. |
14:24
🔗
|
vitzli |
http://www.ewdn.com/2015/10/13/yandex-backpedals-on-new-version-of-movie-platform-after-user-fury/ |
14:26
🔗
|
vitzli |
Though I will try to grab it with wget, may I ask for its archival by archiveteam? |
14:27
🔗
|
|
Atom-- has joined #archiveteam |
14:29
🔗
|
vitzli |
aaand they have bot protection, :-| |
14:30
🔗
|
vitzli |
"Too many requests from your IP address" |
14:30
🔗
|
|
zenguy_pc has joined #archiveteam |
14:30
🔗
|
|
pokeball9 has quit IRC (Quit: Connection closed for inactivity) |
14:34
🔗
|
|
Atom__ has quit IRC (Ping timeout: 506 seconds) |
14:47
🔗
|
|
anomie has quit IRC (Read error: Connection reset by peer) |
14:51
🔗
|
|
anomie has joined #archiveteam |
14:54
🔗
|
|
vtyl has joined #archiveteam |
14:54
🔗
|
|
lytv has quit IRC (Read error: Operation timed out) |
15:02
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
15:11
🔗
|
|
Jogie has quit IRC (ZNC - http://znc.in) |
15:11
🔗
|
|
Jogie has joined #archiveteam |
15:14
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
15:17
🔗
|
|
scyther has quit IRC (Quit: Leaving) |
15:22
🔗
|
VADemon |
Yandex will not kill Kinopoisk |
15:24
🔗
|
|
jspiros has quit IRC (Ping timeout: 186 seconds) |
15:31
🔗
|
|
pokeball9 has joined #archiveteam |
15:32
🔗
|
|
jspiros has joined #archiveteam |
15:36
🔗
|
|
jspiros has quit IRC (Ping timeout: 186 seconds) |
15:39
🔗
|
|
JesseW has joined #archiveteam |
15:43
🔗
|
|
xk_id has quit IRC (Remote host closed the connection) |
15:49
🔗
|
|
jspiros has joined #archiveteam |
15:53
🔗
|
|
jspiros has quit IRC (Ping timeout: 186 seconds) |
15:55
🔗
|
|
jspiros has joined #archiveteam |
15:55
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
15:58
🔗
|
|
primus104 has joined #archiveteam |
16:08
🔗
|
SketchCow |
I assume Yiffiel_d was joking about "as old as Nintendo" |
16:20
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
16:21
🔗
|
|
xk_id has joined #archiveteam |
16:29
🔗
|
|
zenguy_pc has joined #archiveteam |
16:38
🔗
|
|
jspiros has quit IRC (Ping timeout: 186 seconds) |
16:44
🔗
|
|
Dark_Star has joined #archiveteam |
16:46
🔗
|
|
jspiros has joined #archiveteam |
16:51
🔗
|
|
scyther has joined #archiveteam |
17:04
🔗
|
|
godane has joined #archiveteam |
17:50
🔗
|
|
insane_al has joined #archiveteam |
18:17
🔗
|
|
jspiros has quit IRC (Ping timeout: 186 seconds) |
18:18
🔗
|
|
patrickod has joined #archiveteam |
18:20
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
18:24
🔗
|
|
pgoetz has joined #archiveteam |
18:31
🔗
|
|
zenguy_pc has joined #archiveteam |
18:39
🔗
|
|
PurpleSym has joined #archiveteam |
18:40
🔗
|
|
bzc6p__ is now known as bzc6p |
18:56
🔗
|
|
PurpleSym has quit IRC (WeeChat 1.1.1) |
19:11
🔗
|
|
aaaaaaaaa has joined #archiveteam |
19:11
🔗
|
|
swebb sets mode: +o aaaaaaaaa |
19:15
🔗
|
|
insane_al has quit IRC (Read error: Operation timed out) |
19:15
🔗
|
bzc6p |
joepie91, Atluxity: it seems to be actually possible. |
19:16
🔗
|
bzc6p |
The CDX server seems to support querying in *.tld format |
19:17
🔗
|
bzc6p |
e.g. I've tried |
19:17
🔗
|
bzc6p |
wget -O list "http://web.archive.org/cdx/search/cdx?url=*.hu&fl=original" |
19:17
🔗
|
bzc6p |
and it seems to work. And after that, |
19:17
🔗
|
bzc6p |
cut -d"/" -f 3 list | cut -d":" -f 1 | sort | uniq > domainlist |
19:18
🔗
|
bzc6p |
Finally, you can do some regex replace magic if you want second level domains only. |
19:18
🔗
|
bzc6p |
(I don't know how much this method qualifies raping the database, though.) |
19:19
🔗
|
bzc6p |
More info on the CDX server: https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server |
19:19
🔗
|
Kagee |
bzc6p: yep, that's what we found out :) |
19:20
🔗
|
Kagee |
just 50K of queries (estimated with showNumberofPages) |
19:20
🔗
|
Atluxity |
ty bzc6p |
19:21
🔗
|
bzc6p |
Also, it may contain a lot of invalid URLs, apparently. |
19:22
🔗
|
Kagee |
yep :/ |
19:36
🔗
|
|
Zebranky_ is now known as Zebranky |
19:56
🔗
|
ersi |
arkiver: CheckIP() successfully finds my IP to be Swedish and tells me I'm banned |
19:59
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
20:16
🔗
|
|
Ghost_of_ has joined #archiveteam |
20:20
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
20:23
🔗
|
|
scyther has quit IRC (Quit: Leaving) |
20:25
🔗
|
|
xk_id has quit IRC (Read error: Connection reset by peer) |
20:26
🔗
|
|
xk_id has joined #archiveteam |
20:32
🔗
|
|
zenguy_pc has joined #archiveteam |
20:37
🔗
|
|
db48x has quit IRC (Remote host closed the connection) |
20:47
🔗
|
|
aaaaaaaa_ has joined #archiveteam |
20:47
🔗
|
|
aaaaaaaaa has quit IRC (Read error: Connection reset by peer) |
20:47
🔗
|
|
swebb sets mode: +o aaaaaaaa_ |
20:47
🔗
|
|
aaaaaaaa_ is now known as aaaaaaaaa |
21:04
🔗
|
|
Emcy has joined #archiveteam |
21:11
🔗
|
|
Emcy_ has quit IRC (Read error: Operation timed out) |
21:21
🔗
|
arkiver |
ersi: thank you for testing! |
21:25
🔗
|
|
godane has quit IRC (Ping timeout: 310 seconds) |
21:27
🔗
|
|
godane has joined #archiveteam |
21:36
🔗
|
|
chazchaz_ has joined #archiveteam |
21:36
🔗
|
|
chazchaz_ has quit IRC (Remote host closed the connection) |
21:40
🔗
|
|
chazchaz_ has joined #archiveteam |
21:41
🔗
|
|
chazchaz_ has quit IRC (Remote host closed the connection) |
21:47
🔗
|
|
db48x has joined #archiveteam |
21:55
🔗
|
|
chazchaz_ has joined #archiveteam |
21:57
🔗
|
|
chazchaz_ has quit IRC (Remote host closed the connection) |
21:58
🔗
|
|
pgoetz has quit IRC (Remote host closed the connection) |
22:12
🔗
|
|
bzc6p_ has joined #archiveteam |
22:12
🔗
|
|
swebb sets mode: +o bzc6p_ |
22:12
🔗
|
|
user2 has joined #archiveteam |
22:18
🔗
|
|
bzc6p has quit IRC (Ping timeout: 615 seconds) |
22:22
🔗
|
|
zenguy_pc has quit IRC (Read error: Operation timed out) |
22:33
🔗
|
|
zenguy_pc has joined #archiveteam |
22:33
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
22:40
🔗
|
|
dashcloud has joined #archiveteam |
22:48
🔗
|
|
xk_id has quit IRC (Remote host closed the connection) |
23:19
🔗
|
|
user2 is now known as acchan |
23:45
🔗
|
|
Emcy has quit IRC (Read error: Connection reset by peer) |
23:51
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
23:51
🔗
|
|
chazchaz_ has joined #archiveteam |
23:54
🔗
|
|
dashcloud has joined #archiveteam |