Time |
Nickname |
Message |
00:10
🔗
|
|
furrie has quit IRC (Quit: Page closed) |
00:17
🔗
|
|
furrie has joined #archiveteam-bs |
00:18
🔗
|
furrie |
yeah, hard work done by others should be archived |
00:18
🔗
|
furrie |
too bad some of it got lost in the rapidly-changing internet |
00:19
🔗
|
furrie |
i wish the archiveteam had a program that could make the lost sites reappear in thin air on to the Wayback Machine, complete. |
00:23
🔗
|
furrie |
i figure we should start archiving educational stuff |
00:33
🔗
|
furrie |
xmc: wow, i'm checking for random sonic stuff in the wayback machine, and it's been all recently added in those days I added all of those sonic sites in the archiveBot. |
00:39
🔗
|
|
furrie has quit IRC (Quit: Page closed) |
00:39
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
00:51
🔗
|
|
GLaDOS has quit IRC (Read error: Operation timed out) |
00:53
🔗
|
|
GLaDOS has joined #archiveteam-bs |
00:59
🔗
|
|
GLaDOS has quit IRC (Ping timeout: 252 seconds) |
01:08
🔗
|
yipdw |
so someone made a Markov chain with Spring Framework class names |
01:08
🔗
|
yipdw |
should repeat that with Core Media function names |
01:09
🔗
|
yipdw |
I mean this is a function I am actually reading about now: CMMetadataFormatDescriptionCreateWithMetadataFormatDescriptionAndMetadataSpecifications |
01:11
🔗
|
yipdw |
that said I guess it doesn't compare to -[MTLBlitCommandEncoder copyFromTexture:sourceSlice:sourceLevel:sourceOrigin:sourceSize:toBuffer:destinationOffset:destinationBytesPerRow:destinationBytesPerImage:] |
01:17
🔗
|
|
dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) |
01:18
🔗
|
|
dashcloud has joined #archiveteam-bs |
01:18
🔗
|
|
JesseW has joined #archiveteam-bs |
01:33
🔗
|
pikhq |
SketchCow: Still wanting someone to test Internet Arcade on Windows 10 & MS Edge? |
01:33
🔗
|
pikhq |
MS Edge is (of course) not the browser I regularly use, but I do have a Windows 10 system right here. :) |
01:38
🔗
|
pikhq |
... Well, I believe it's moot because I currently can't access archive.org. |
01:39
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
01:44
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
01:47
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
01:48
🔗
|
|
JesseW has joined #archiveteam-bs |
01:50
🔗
|
|
dashcloud has joined #archiveteam-bs |
01:52
🔗
|
|
schbirid has joined #archiveteam-bs |
02:02
🔗
|
JesseW |
What is the process between WARCs being uploaded to IA and them showing up in the Wayback Machine, anyway? Is it manual, semi-manual, or? What is the typical time scale? Is there an easy way to check? |
02:03
🔗
|
DFJustin |
it's automatic and should be within a day or so |
02:03
🔗
|
DFJustin |
easy way to check is http://web.archive.org/*/http://www.example.com/ |
02:03
🔗
|
|
yipdw has quit IRC (Remote host closed the connection) |
02:05
🔗
|
|
yipdw has joined #archiveteam-bs |
02:07
🔗
|
JesseW |
I meant, for a given item (containing a MegaWARC with 1000s of pages) a way to quickly make sure they have been included... |
02:07
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
02:08
🔗
|
JesseW |
For that matter, I'm not (yet) sure how to identify the list of pages in a megawarc item... |
02:11
🔗
|
yipdw |
you can scan the generated cdx |
02:12
🔗
|
|
kyan has joined #archiveteam-bs |
02:12
🔗
|
JesseW |
Is archive.org down for other people? https://archive.org/details/archiveteam_zapd is unresponsive for me... |
02:13
🔗
|
aaaaaaaaa |
yep, been that way for awhile |
02:13
🔗
|
JesseW |
hm -- not status updates, I presume. |
02:13
🔗
|
JesseW |
er, s/not/no/ |
02:16
🔗
|
|
kyanz`bot has joined #archiveteam-bs |
02:16
🔗
|
yipdw |
works for me |
02:16
🔗
|
|
tomwsmf-a has quit IRC (Ping timeout: 258 seconds) |
02:17
🔗
|
aaaaaaaaa |
does for me again |
02:17
🔗
|
dxrt |
back |
02:17
🔗
|
JesseW |
interesting. |
02:17
🔗
|
JesseW |
(works for me, also) |
02:17
🔗
|
pikhq |
Yay? |
02:18
🔗
|
JesseW |
is there a offsite status page for IA? |
02:20
🔗
|
kyan |
JesseW, I think they sometimes tweet at https://twitter.com/@internetarchive |
02:21
🔗
|
JesseW |
kyan: thanks -- checked there, no mention of the current problems |
02:21
🔗
|
kyan |
yeah i didn't see one either :P |
02:23
🔗
|
JesseW |
is there a "official" IA IRC channel? |
02:23
🔗
|
aaaaaaaaa |
well, it is 7:20 PDT on a friday |
02:23
🔗
|
aaaaaaaaa |
depending on the problem there may not be any time to tweet |
02:24
🔗
|
kyan |
there's #internetarchive (EFnet) |
02:24
🔗
|
kyan |
but as it says in the topic, it's unofficial |
02:25
🔗
|
JesseW |
I see. |
02:26
🔗
|
* |
JesseW really needs to hack my copy of TsLogBot.py to support changing the list of channels logged without a hard reboot... |
02:28
🔗
|
JesseW |
So, about CDX files -- what is the .cdx.idx (as opposed to .cdx.gz)? The _files.xml file says it is a "Item CDX Meta-Index"... |
02:29
🔗
|
|
primus104 has quit IRC (Leaving.) |
02:29
🔗
|
JesseW |
I'm looking at https://archive.org/download/archiveteam_zapd_20131029051118 |
02:30
🔗
|
yipdw |
you want the megawarc CDX |
02:31
🔗
|
yipdw |
e.g |
02:31
🔗
|
yipdw |
curl -sL 'https://archive.org/download/archiveteam_zapd_20131016071259/zapd_20131016071259.megawarc.warc.os.cdx.gz' | gunzip -c | cut -f3 -d' ' |
02:31
🔗
|
yipdw |
that'll list all URLs, in URL order, in that megawarc |
02:33
🔗
|
JesseW |
nice. |
02:33
🔗
|
* |
JesseW goes to add that to the wiki... |
02:34
🔗
|
yipdw |
that said there are better ways |
02:35
🔗
|
yipdw |
Wayback, for example |
02:36
🔗
|
yipdw |
the os.cdx.gz IIRC is a megawarc-specific thing |
02:37
🔗
|
yipdw |
but that pipeline should work for any WARC CDX you find |
02:38
🔗
|
yipdw |
the rest of the cdx record is useful as well; for example, if you want to know what WARC to look in, you can get the filename, offset, and size from the record also |
02:38
🔗
|
yipdw |
in the case of megawarc you will get the original WARC name, offset, and size |
02:49
🔗
|
JesseW |
added to http://archiveteam.org/index.php?title=The_WARC_Ecosystem#CDX_File_Format |
02:53
🔗
|
|
JesseW1 has joined #archiveteam-bs |
02:55
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
03:02
🔗
|
|
GLaDOS has joined #archiveteam-bs |
03:22
🔗
|
|
GLaDOS has quit IRC (Ping timeout: 252 seconds) |
03:25
🔗
|
|
GLaDOS has joined #archiveteam-bs |
04:06
🔗
|
|
kyan has quit IRC (Read error: Connection reset by peer) |
04:06
🔗
|
|
kyan has joined #archiveteam-bs |
04:22
🔗
|
|
mistym has joined #archiveteam-bs |
04:25
🔗
|
|
kyanz`bot has quit IRC (Ping timeout: 1221 seconds) |
04:30
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
05:06
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
05:06
🔗
|
|
dashcloud has joined #archiveteam-bs |
05:23
🔗
|
|
xtr-201 has quit IRC (Read error: Operation timed out) |
05:28
🔗
|
|
BlueMaxim has quit IRC (Ping timeout: 306 seconds) |
05:28
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
05:30
🔗
|
|
dashcloud has quit IRC (Quit: No Ping reply in 210 seconds.) |
05:35
🔗
|
|
dashcloud has joined #archiveteam-bs |
05:44
🔗
|
|
ewrerwt has joined #archiveteam-bs |
05:44
🔗
|
|
ewrerwt is now known as skrp |
05:44
🔗
|
|
skrp is now known as rejk |
05:51
🔗
|
rejk |
http://imagebin.ca/v/29uYWbJ6ijEn |
05:51
🔗
|
rejk |
freebsd on all three file servers: ntfs ufs zfs |
05:54
🔗
|
rejk |
100TB archive of traffic information. funny things happen in internet traffic. you get traffic like 56.2GB The Master Collection of How To Date Women. |
05:54
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
06:15
🔗
|
|
mistym has joined #archiveteam-bs |
06:32
🔗
|
|
Sanqui is now known as Sanqui|go |
06:58
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
07:00
🔗
|
|
mistym has joined #archiveteam-bs |
07:17
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
07:33
🔗
|
|
dashcloud has joined #archiveteam-bs |
07:51
🔗
|
|
JesseW1 has quit IRC (Quit: Leaving.) |
08:12
🔗
|
|
rejk has quit IRC (Remote host closed the connection) |
08:17
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
08:26
🔗
|
|
mistym has joined #archiveteam-bs |
08:30
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
08:31
🔗
|
Ctrl-S |
do you guys archive the various linux distros, microsoft patches/releases/etc and other things like that? |
08:33
🔗
|
|
mistym has joined #archiveteam-bs |
08:42
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
08:51
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
08:58
🔗
|
|
dashcloud has joined #archiveteam-bs |
09:21
🔗
|
|
primus104 has joined #archiveteam-bs |
09:41
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
09:42
🔗
|
|
mistym has joined #archiveteam-bs |
09:44
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
09:46
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
09:49
🔗
|
|
mistym has quit IRC (Read error: Operation timed out) |
09:50
🔗
|
|
dashcloud has joined #archiveteam-bs |
10:08
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
10:14
🔗
|
|
dashcloud has joined #archiveteam-bs |
10:26
🔗
|
joepie91_ |
Ctrl-S: not on a regular basis, I think |
10:26
🔗
|
joepie91_ |
but occasionally, yes |
10:26
🔗
|
joepie91_ |
you're welcome to change that, ofc ;) |
10:28
🔗
|
Ctrl-S |
it seems like the kind of thing thaw would be essential to keep archived somewhere |
10:28
🔗
|
|
joepie91_ is now known as joepie91 |
10:48
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
12:32
🔗
|
|
vitzli has joined #archiveteam-bs |
12:33
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
12:45
🔗
|
|
schbirid has joined #archiveteam-bs |
12:56
🔗
|
|
Ravenloft has quit IRC (Remote host closed the connection) |
13:25
🔗
|
|
godane has joined #archiveteam-bs |
13:27
🔗
|
godane |
i'm back |
13:27
🔗
|
godane |
i'm on my new system |
13:28
🔗
|
godane |
i found a work around to boot into my ext4 partition even when UEFI doesn't see it |
13:39
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
13:54
🔗
|
|
godane1 has joined #archiveteam-bs |
13:55
🔗
|
|
godane has quit IRC (Read error: Operation timed out) |
13:56
🔗
|
|
xtr-201 has joined #archiveteam-bs |
14:09
🔗
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
14:15
🔗
|
|
xtr-201 has quit IRC (Read error: Operation timed out) |
14:16
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
14:31
🔗
|
|
xtr-201 has joined #archiveteam-bs |
14:31
🔗
|
|
dashcloud has joined #archiveteam-bs |
14:33
🔗
|
|
godane1 has quit IRC (Ping timeout: 306 seconds) |
14:40
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
14:42
🔗
|
|
godane has joined #archiveteam-bs |
14:56
🔗
|
|
kyan has quit IRC (Quit: This computer has gone to sleep) |
14:57
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
15:03
🔗
|
|
dashcloud has joined #archiveteam-bs |
15:04
🔗
|
|
primus104 has quit IRC (Leaving.) |
15:09
🔗
|
|
SadDM has quit IRC (Ping timeout: 483 seconds) |
15:12
🔗
|
|
SadDM has joined #archiveteam-bs |
15:20
🔗
|
|
tomwsmf-a has quit IRC (Ping timeout: 258 seconds) |
15:30
🔗
|
|
SimpBrain has joined #archiveteam-bs |
15:36
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
15:40
🔗
|
|
dashcloud has joined #archiveteam-bs |
15:56
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
16:03
🔗
|
|
SimpBrain has quit IRC (Quit: Leaving) |
16:04
🔗
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
16:06
🔗
|
|
SimpBrain has joined #archiveteam-bs |
16:24
🔗
|
|
JesseW has joined #archiveteam-bs |
16:27
🔗
|
|
JesseW has left |
16:29
🔗
|
|
Kirk has joined #archiveteam-bs |
16:29
🔗
|
|
raylee has joined #archiveteam-bs |
16:29
🔗
|
|
wm_ has joined #archiveteam-bs |
16:36
🔗
|
|
raylee is now known as Rye |
16:43
🔗
|
|
mistym has joined #archiveteam-bs |
16:45
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
17:18
🔗
|
|
godane has quit IRC (Quit: Leaving.) |
17:20
🔗
|
|
Ravenloft has joined #archiveteam-bs |
17:34
🔗
|
|
aaaaaaaaa has joined #archiveteam-bs |
17:36
🔗
|
|
dashcloud has quit IRC (Ping timeout: 483 seconds) |
17:42
🔗
|
|
dashcloud has joined #archiveteam-bs |
17:45
🔗
|
|
primus104 has joined #archiveteam-bs |
18:03
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
18:03
🔗
|
|
mistym has quit IRC (Ping timeout: 252 seconds) |
18:06
🔗
|
|
godane has joined #archiveteam-bs |
18:06
🔗
|
|
mistym has joined #archiveteam-bs |
18:07
🔗
|
godane |
looks like we could grab speedtest.net |
18:08
🔗
|
godane |
it goes as far back as march 2007: http://www.speedtest.net/my-result/106000000 |
18:10
🔗
|
|
aaaaaaaaa has joined #archiveteam-bs |
18:11
🔗
|
midas |
thats cool godane :D |
18:15
🔗
|
godane |
i figure it could be useful data |
18:16
🔗
|
godane |
we can then look at data by isp over time based on location |
18:20
🔗
|
|
primus104 has quit IRC (Read error: Connection reset by peer) |
18:26
🔗
|
|
primus104 has joined #archiveteam-bs |
19:04
🔗
|
|
mistym has quit IRC (Ping timeout: 252 seconds) |
19:07
🔗
|
|
mistym has joined #archiveteam-bs |
19:16
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
19:17
🔗
|
|
Start has joined #archiveteam-bs |
19:17
🔗
|
|
JesseW has joined #archiveteam-bs |
19:47
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
19:51
🔗
|
|
tomwsmf-a has quit IRC (Ping timeout: 258 seconds) |
19:55
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
20:07
🔗
|
|
JesseW has joined #archiveteam-bs |
20:11
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
20:13
🔗
|
|
JesseW has quit IRC (Client Quit) |
20:16
🔗
|
|
JesseW has joined #archiveteam-bs |
20:20
🔗
|
|
mistym_ has joined #archiveteam-bs |
20:26
🔗
|
|
mistym has quit IRC (Read error: Operation timed out) |
20:37
🔗
|
|
rejk has joined #archiveteam-bs |
20:45
🔗
|
|
JesseW has quit IRC (Leaving.) |
20:46
🔗
|
aaaaaaaaa |
Well, looks like Microsoft has in app purchases in the default software: http://www.wired.co.uk/news/archive/2015-07/30/windows-10-paid-ad-removal-solitaire |
20:47
🔗
|
rejk |
no surprising since the xbox one dashboard is covered in ads. fkn bs |
21:22
🔗
|
|
SimpBrain has quit IRC (Read error: Connection reset by peer) |
21:22
🔗
|
|
SimpBrai1 has joined #archiveteam-bs |
21:25
🔗
|
|
SimpBrai1 has quit IRC (Read error: Connection reset by peer) |
21:27
🔗
|
|
schbirid has quit IRC (Leaving) |
21:31
🔗
|
|
SimpBrain has joined #archiveteam-bs |
21:36
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
21:38
🔗
|
|
RichardG has joined #archiveteam-bs |
21:43
🔗
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
21:52
🔗
|
|
mistym has joined #archiveteam-bs |
21:57
🔗
|
|
mistym__ has joined #archiveteam-bs |
21:57
🔗
|
|
mistym_ has quit IRC (Ping timeout: 483 seconds) |
21:59
🔗
|
|
mistym has quit IRC (Read error: Operation timed out) |
22:02
🔗
|
|
mistym has joined #archiveteam-bs |
22:07
🔗
|
|
mistym__ has quit IRC (Ping timeout: 606 seconds) |
22:10
🔗
|
|
primus104 has quit IRC (Leaving.) |
22:14
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
22:27
🔗
|
|
lexicon has joined #archiveteam-bs |
22:27
🔗
|
|
lexicon is now known as lexiconda |
22:27
🔗
|
|
lexiconda is now known as lexicon |
22:40
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
22:43
🔗
|
dashcloud |
aaaaaaaaa: apparently that's not new- it's been there since windows 8, but since no one used that, no one noticed until now |
23:17
🔗
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
23:21
🔗
|
|
SimpBrain has quit IRC (Leaving) |
23:41
🔗
|
SketchCow |
Right |
23:41
🔗
|
SketchCow |
http://fffff.at/ |
23:41
🔗
|
|
mistym has joined #archiveteam-bs |
23:48
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
23:52
🔗
|
|
mistym has quit IRC (Ping timeout: 606 seconds) |