Time |
Nickname |
Message |
00:15
🔗
|
|
Sk2d has joined #archiveteam |
00:16
🔗
|
|
BlueMax has joined #archiveteam |
00:18
🔗
|
|
Sk1d has quit IRC (Read error: Operation timed out) |
00:18
🔗
|
|
Sk2d is now known as Sk1d |
00:22
🔗
|
|
Ctrl has joined #archiveteam |
00:37
🔗
|
|
alfie has quit IRC (Quit: Bye.) |
00:59
🔗
|
|
fie has quit IRC (Ping timeout: 632 seconds) |
01:50
🔗
|
|
fie has joined #archiveteam |
02:18
🔗
|
|
balrog has quit IRC (Quit: Bye) |
02:19
🔗
|
|
Flashfire has joined #archiveteam |
02:19
🔗
|
Flashfire |
https://www.cordcuttersnews.com/sony-crackle-is-shutting-down-in-canada/ |
02:24
🔗
|
|
balrog has joined #archiveteam |
02:38
🔗
|
|
Flashfire has quit IRC (Quit: http://www.mibbit.com ajax IRC Client) |
02:52
🔗
|
|
Darkstar has quit IRC (Ping timeout: 480 seconds) |
02:58
🔗
|
|
ta9le has quit IRC (Quit: Connection closed for inactivity) |
02:59
🔗
|
|
Darkstar has joined #archiveteam |
03:08
🔗
|
|
archodg_ has joined #archiveteam |
03:10
🔗
|
|
odemg has quit IRC (Read error: Operation timed out) |
03:11
🔗
|
|
archodg has quit IRC (Read error: Operation timed out) |
03:24
🔗
|
|
odemg has joined #archiveteam |
03:26
🔗
|
|
pizzaiolo has quit IRC (Remote host closed the connection) |
03:39
🔗
|
|
Darkstar has quit IRC (Ping timeout: 480 seconds) |
03:45
🔗
|
|
Flashfire has joined #archiveteam |
03:46
🔗
|
|
Darkstar has joined #archiveteam |
03:55
🔗
|
|
BlueMax has quit IRC (Leaving) |
04:03
🔗
|
|
Flashfire has left |
04:31
🔗
|
|
Darkstar has quit IRC (Ping timeout: 480 seconds) |
04:32
🔗
|
|
BlueMax has joined #archiveteam |
04:43
🔗
|
|
Darkstar has joined #archiveteam |
05:16
🔗
|
|
Darkstar has quit IRC (Ping timeout: 246 seconds) |
05:22
🔗
|
|
Darkstar has joined #archiveteam |
06:04
🔗
|
|
tstarling has joined #archiveteam |
06:07
🔗
|
|
Darkstar has quit IRC (Ping timeout: 506 seconds) |
06:12
🔗
|
tstarling |
I want to talk about archiving https://arbital.com/ |
06:13
🔗
|
tstarling |
it uses XHR POST requests to get the article content, which is apparently why IA etc. has not been able to archive it successfully |
06:14
🔗
|
tstarling |
this website is a failed startup and the owner is in the process of winding it up, unclear what will happen |
06:15
🔗
|
|
Darkstar has joined #archiveteam |
06:15
🔗
|
tstarling |
I tried emailing him, no response |
06:25
🔗
|
tstarling |
I'm just trying to figure out what it would take to make an archive |
06:27
🔗
|
tstarling |
I could do a bit of dev work to support this but I'm not sure what I would need to patch at this point |
06:27
🔗
|
tstarling |
or whether it would be easier to set up a proxy website and crawl the proxy instead |
06:31
🔗
|
tstarling |
wrong time of the day for this channel, I guess, but maybe I'll carry on thinking out loud |
06:32
🔗
|
tstarling |
a proxy website could replace XMLHttpRequest with a version that converts POST parameters to query string parameters and sends them to a GET-to-POST gateway also on the proxy side |
06:41
🔗
|
|
schbirid has joined #archiveteam |
06:53
🔗
|
|
Darkstar has quit IRC (Ping timeout: 246 seconds) |
06:58
🔗
|
tstarling |
but is that the best we can do? obviously a search of the wayback machine for arbital.com would still only give you blank pages |
06:58
🔗
|
tstarling |
you'd have to search for the proxy |
07:03
🔗
|
|
Darkstar has joined #archiveteam |
07:04
🔗
|
fenn |
tstarling: someone affiliated with arbital said just now that it's not shutting down, fwiw |
07:06
🔗
|
tstarling |
it's just one guy though, right? all he has to do is stop paying the Google App Engine bills |
07:06
🔗
|
tstarling |
bbl |
07:24
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
07:47
🔗
|
|
Darkstar has quit IRC (Ping timeout: 246 seconds) |
08:08
🔗
|
|
Darkstar has joined #archiveteam |
08:35
🔗
|
|
BlueMaxim has joined #archiveteam |
08:39
🔗
|
|
redlob has quit IRC (Read error: Operation timed out) |
08:40
🔗
|
|
BlueMax has quit IRC (Read error: Operation timed out) |
08:41
🔗
|
|
BlueMaxim has quit IRC (Remote host closed the connection) |
08:41
🔗
|
|
BlueMax has joined #archiveteam |
08:53
🔗
|
|
redlob has joined #archiveteam |
09:34
🔗
|
|
ta9le has joined #archiveteam |
09:40
🔗
|
Muad-Dib |
I'm not sure we ever grabbed the "detailed" match statistics with #yolohalo, but they're going in 3 days https://www.bungie.net/en/Explore/Detail/News/46965 |
09:45
🔗
|
|
Aoede has quit IRC (Ping timeout: 252 seconds) |
09:47
🔗
|
|
Aoede has joined #archiveteam |
09:54
🔗
|
|
BlueMax has quit IRC (Leaving) |
10:56
🔗
|
|
zino__ has quit IRC (Remote host closed the connection) |
11:19
🔗
|
|
alfie has joined #archiveteam |
11:22
🔗
|
tstarling |
I guess there's a number of ways to spoof the contents of a website before putting it into a WARC, e.g. with an HTTP proxy or with /etc/hosts |
11:23
🔗
|
tstarling |
the question then would be whether that is allowable for ArchiveBot and IA |
11:25
🔗
|
tstarling |
but I am in the right place, right? You're the sort of people who (like me) care about websites disappearing? |
11:28
🔗
|
|
pizzaiolo has joined #archiveteam |
11:36
🔗
|
|
Stiletto has quit IRC (Read error: Connection reset by peer) |
12:07
🔗
|
Muad-Dib |
tstarling: this sounds like material for #archivebot-bs |
12:08
🔗
|
tstarling |
thanks Muad-Dib |
12:09
🔗
|
Muad-Dib |
np |
12:09
🔗
|
tstarling |
but better to try in the US day time by the looks of it? |
12:09
🔗
|
Muad-Dib |
depends, there are a lot of europeans here too, but probably more Americans, yeah |
12:10
🔗
|
Muad-Dib |
but then again, many of us are a bit crazy and might have weird sleep schedules ;) |
12:12
🔗
|
tstarling |
I'll try in 10 hours or so, tomorrow morning for me |
12:12
🔗
|
|
tstarling has quit IRC (Quit: Leaving) |
12:30
🔗
|
|
Lactuca has joined #archiveteam |
12:33
🔗
|
|
Lactuca has quit IRC (Client Quit) |
13:27
🔗
|
phillipsj |
Using POST to access content is such an Internet no no, I wonder if it is designed to foil crawlers. |
13:32
🔗
|
|
ZoeB_ has joined #archiveteam |
14:13
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
14:14
🔗
|
|
ancaster has joined #archiveteam |
14:15
🔗
|
|
Mateon1 has joined #archiveteam |
14:27
🔗
|
|
ancaster has quit IRC (Quit: Page closed) |
14:48
🔗
|
|
antomati_ has joined #archiveteam |
14:50
🔗
|
|
antomatic has quit IRC (Read error: Operation timed out) |
14:50
🔗
|
|
antomati_ is now known as antomatic |
15:38
🔗
|
|
MMovie has quit IRC (Read error: Connection reset by peer) |
17:04
🔗
|
|
MMovie has joined #archiveteam |
17:21
🔗
|
|
db48x has joined #archiveteam |
17:31
🔗
|
|
Ctrl has quit IRC (Ping timeout: 268 seconds) |
17:32
🔗
|
|
Aoede has quit IRC (Quit: ZNC - http://znc.in) |
17:40
🔗
|
|
Stilett0 has joined #archiveteam |
17:43
🔗
|
|
redlob has quit IRC (Read error: Operation timed out) |
17:44
🔗
|
|
redlob has joined #archiveteam |
18:01
🔗
|
|
Ctrl has joined #archiveteam |
18:01
🔗
|
|
pizzaiolo has quit IRC (Read error: Operation timed out) |
18:09
🔗
|
|
adinbied has quit IRC (Quit: Leaving) |
18:36
🔗
|
|
schbirid has joined #archiveteam |
19:01
🔗
|
|
ta9le has quit IRC (Quit: Connection closed for inactivity) |
19:07
🔗
|
|
Aoede has joined #archiveteam |
19:14
🔗
|
|
ZoeB_ has quit IRC (Quit: ZoeB_) |
20:15
🔗
|
|
antomati_ has joined #archiveteam |
20:15
🔗
|
|
antomati_ has quit IRC (Connection closed) |
20:16
🔗
|
|
antomatic has quit IRC (Ping timeout: 252 seconds) |
20:34
🔗
|
|
antomatic has joined #archiveteam |
20:47
🔗
|
|
jschwart has joined #archiveteam |
20:55
🔗
|
|
schbirid has quit IRC (Remote host closed the connection) |
21:29
🔗
|
|
jschwart has quit IRC (Quit: Konversation terminated!) |
21:32
🔗
|
|
ta9le has joined #archiveteam |
22:08
🔗
|
|
Ctrl has quit IRC (Ping timeout: 268 seconds) |
22:09
🔗
|
|
Meroje has quit IRC (Ping timeout: 260 seconds) |
22:15
🔗
|
|
Meroje has joined #archiveteam |
22:36
🔗
|
|
Ctrl has joined #archiveteam |
23:26
🔗
|
|
BlueMax has joined #archiveteam |
23:54
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |