#archiveteam-bs 2015-08-01,Sat

↑back Search

Time Nickname Message
00:10 🔗 furrie has quit IRC (Quit: Page closed)
00:17 🔗 furrie has joined #archiveteam-bs
00:18 🔗 furrie yeah, hard work done by others should be archived
00:18 🔗 furrie too bad some of it got lost in the rapidly-changing internet
00:19 🔗 furrie i wish the archiveteam had a program that could make the lost sites reappear in thin air on to the Wayback Machine, complete.
00:23 🔗 furrie i figure we should start archiving educational stuff
00:33 🔗 furrie xmc: wow, i'm checking for random sonic stuff in the wayback machine, and it's been all recently added in those days I added all of those sonic sites in the archiveBot.
00:39 🔗 furrie has quit IRC (Quit: Page closed)
00:39 🔗 mistym has quit IRC (Remote host closed the connection)
00:51 🔗 GLaDOS has quit IRC (Read error: Operation timed out)
00:53 🔗 GLaDOS has joined #archiveteam-bs
00:59 🔗 GLaDOS has quit IRC (Ping timeout: 252 seconds)
01:08 🔗 yipdw so someone made a Markov chain with Spring Framework class names
01:08 🔗 yipdw should repeat that with Core Media function names
01:09 🔗 yipdw I mean this is a function I am actually reading about now: CMMetadataFormatDescriptionCreateWithMetadataFormatDescriptionAndMetadataSpecifications
01:11 🔗 yipdw that said I guess it doesn't compare to -[MTLBlitCommandEncoder copyFromTexture:sourceSlice:sourceLevel:sourceOrigin:sourceSize:toBuffer:destinationOffset:destinationBytesPerRow:destinationBytesPerImage:]
01:17 🔗 dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.)
01:18 🔗 dashcloud has joined #archiveteam-bs
01:18 🔗 JesseW has joined #archiveteam-bs
01:33 🔗 pikhq SketchCow: Still wanting someone to test Internet Arcade on Windows 10 & MS Edge?
01:33 🔗 pikhq MS Edge is (of course) not the browser I regularly use, but I do have a Windows 10 system right here. :)
01:38 🔗 pikhq ... Well, I believe it's moot because I currently can't access archive.org.
01:39 🔗 schbirid has quit IRC (Read error: Operation timed out)
01:44 🔗 JesseW has quit IRC (Quit: Leaving.)
01:47 🔗 dashcloud has quit IRC (Read error: Operation timed out)
01:48 🔗 JesseW has joined #archiveteam-bs
01:50 🔗 dashcloud has joined #archiveteam-bs
01:52 🔗 schbirid has joined #archiveteam-bs
02:02 🔗 JesseW What is the process between WARCs being uploaded to IA and them showing up in the Wayback Machine, anyway? Is it manual, semi-manual, or? What is the typical time scale? Is there an easy way to check?
02:03 🔗 DFJustin it's automatic and should be within a day or so
02:03 🔗 DFJustin easy way to check is http://web.archive.org/*/http://www.example.com/
02:03 🔗 yipdw has quit IRC (Remote host closed the connection)
02:05 🔗 yipdw has joined #archiveteam-bs
02:07 🔗 JesseW I meant, for a given item (containing a MegaWARC with 1000s of pages) a way to quickly make sure they have been included...
02:07 🔗 tomwsmf-a has joined #archiveteam-bs
02:08 🔗 JesseW For that matter, I'm not (yet) sure how to identify the list of pages in a megawarc item...
02:11 🔗 yipdw you can scan the generated cdx
02:12 🔗 kyan has joined #archiveteam-bs
02:12 🔗 JesseW Is archive.org down for other people? https://archive.org/details/archiveteam_zapd is unresponsive for me...
02:13 🔗 aaaaaaaaa yep, been that way for awhile
02:13 🔗 JesseW hm -- not status updates, I presume.
02:13 🔗 JesseW er, s/not/no/
02:16 🔗 kyanz`bot has joined #archiveteam-bs
02:16 🔗 yipdw works for me
02:16 🔗 tomwsmf-a has quit IRC (Ping timeout: 258 seconds)
02:17 🔗 aaaaaaaaa does for me again
02:17 🔗 dxrt back
02:17 🔗 JesseW interesting.
02:17 🔗 JesseW (works for me, also)
02:17 🔗 pikhq Yay?
02:18 🔗 JesseW is there a offsite status page for IA?
02:20 🔗 kyan JesseW, I think they sometimes tweet at https://twitter.com/@internetarchive
02:21 🔗 JesseW kyan: thanks -- checked there, no mention of the current problems
02:21 🔗 kyan yeah i didn't see one either :P
02:23 🔗 JesseW is there a "official" IA IRC channel?
02:23 🔗 aaaaaaaaa well, it is 7:20 PDT on a friday
02:23 🔗 aaaaaaaaa depending on the problem there may not be any time to tweet
02:24 🔗 kyan there's #internetarchive (EFnet)
02:24 🔗 kyan but as it says in the topic, it's unofficial
02:25 🔗 JesseW I see.
02:26 🔗 * JesseW really needs to hack my copy of TsLogBot.py to support changing the list of channels logged without a hard reboot...
02:28 🔗 JesseW So, about CDX files -- what is the .cdx.idx (as opposed to .cdx.gz)? The _files.xml file says it is a "Item CDX Meta-Index"...
02:29 🔗 primus104 has quit IRC (Leaving.)
02:29 🔗 JesseW I'm looking at https://archive.org/download/archiveteam_zapd_20131029051118
02:30 🔗 yipdw you want the megawarc CDX
02:31 🔗 yipdw e.g
02:31 🔗 yipdw curl -sL 'https://archive.org/download/archiveteam_zapd_20131016071259/zapd_20131016071259.megawarc.warc.os.cdx.gz' | gunzip -c | cut -f3 -d' '
02:31 🔗 yipdw that'll list all URLs, in URL order, in that megawarc
02:33 🔗 JesseW nice.
02:33 🔗 * JesseW goes to add that to the wiki...
02:34 🔗 yipdw that said there are better ways
02:35 🔗 yipdw Wayback, for example
02:36 🔗 yipdw the os.cdx.gz IIRC is a megawarc-specific thing
02:37 🔗 yipdw but that pipeline should work for any WARC CDX you find
02:38 🔗 yipdw the rest of the cdx record is useful as well; for example, if you want to know what WARC to look in, you can get the filename, offset, and size from the record also
02:38 🔗 yipdw in the case of megawarc you will get the original WARC name, offset, and size
02:49 🔗 JesseW added to http://archiveteam.org/index.php?title=The_WARC_Ecosystem#CDX_File_Format
02:53 🔗 JesseW1 has joined #archiveteam-bs
02:55 🔗 JesseW has quit IRC (Read error: Operation timed out)
03:02 🔗 GLaDOS has joined #archiveteam-bs
03:22 🔗 GLaDOS has quit IRC (Ping timeout: 252 seconds)
03:25 🔗 GLaDOS has joined #archiveteam-bs
04:06 🔗 kyan has quit IRC (Read error: Connection reset by peer)
04:06 🔗 kyan has joined #archiveteam-bs
04:22 🔗 mistym has joined #archiveteam-bs
04:25 🔗 kyanz`bot has quit IRC (Ping timeout: 1221 seconds)
04:30 🔗 aaaaaaaaa has quit IRC (Leaving)
05:06 🔗 dashcloud has quit IRC (Read error: Connection reset by peer)
05:06 🔗 dashcloud has joined #archiveteam-bs
05:23 🔗 xtr-201 has quit IRC (Read error: Operation timed out)
05:28 🔗 BlueMaxim has quit IRC (Ping timeout: 306 seconds)
05:28 🔗 BlueMaxim has joined #archiveteam-bs
05:30 🔗 dashcloud has quit IRC (Quit: No Ping reply in 210 seconds.)
05:35 🔗 dashcloud has joined #archiveteam-bs
05:44 🔗 ewrerwt has joined #archiveteam-bs
05:44 🔗 ewrerwt is now known as skrp
05:44 🔗 skrp is now known as rejk
05:51 🔗 rejk http://imagebin.ca/v/29uYWbJ6ijEn
05:51 🔗 rejk freebsd on all three file servers: ntfs ufs zfs
05:54 🔗 rejk 100TB archive of traffic information. funny things happen in internet traffic. you get traffic like 56.2GB The Master Collection of How To Date Women.
05:54 🔗 mistym has quit IRC (Remote host closed the connection)
06:15 🔗 mistym has joined #archiveteam-bs
06:32 🔗 Sanqui is now known as Sanqui|go
06:58 🔗 mistym has quit IRC (Remote host closed the connection)
07:00 🔗 mistym has joined #archiveteam-bs
07:17 🔗 dashcloud has quit IRC (Read error: Operation timed out)
07:33 🔗 dashcloud has joined #archiveteam-bs
07:51 🔗 JesseW1 has quit IRC (Quit: Leaving.)
08:12 🔗 rejk has quit IRC (Remote host closed the connection)
08:17 🔗 mistym has quit IRC (Remote host closed the connection)
08:26 🔗 mistym has joined #archiveteam-bs
08:30 🔗 mistym has quit IRC (Remote host closed the connection)
08:31 🔗 Ctrl-S do you guys archive the various linux distros, microsoft patches/releases/etc and other things like that?
08:33 🔗 mistym has joined #archiveteam-bs
08:42 🔗 mistym has quit IRC (Remote host closed the connection)
08:51 🔗 dashcloud has quit IRC (Read error: Operation timed out)
08:58 🔗 dashcloud has joined #archiveteam-bs
09:21 🔗 primus104 has joined #archiveteam-bs
09:41 🔗 BlueMaxim has quit IRC (Quit: Leaving)
09:42 🔗 mistym has joined #archiveteam-bs
09:44 🔗 BlueMaxim has joined #archiveteam-bs
09:46 🔗 dashcloud has quit IRC (Read error: Operation timed out)
09:49 🔗 mistym has quit IRC (Read error: Operation timed out)
09:50 🔗 dashcloud has joined #archiveteam-bs
10:08 🔗 dashcloud has quit IRC (Read error: Operation timed out)
10:14 🔗 dashcloud has joined #archiveteam-bs
10:26 🔗 joepie91_ Ctrl-S: not on a regular basis, I think
10:26 🔗 joepie91_ but occasionally, yes
10:26 🔗 joepie91_ you're welcome to change that, ofc ;)
10:28 🔗 Ctrl-S it seems like the kind of thing thaw would be essential to keep archived somewhere
10:28 🔗 joepie91_ is now known as joepie91
10:48 🔗 BlueMaxim has quit IRC (Quit: Leaving)
12:32 🔗 vitzli has joined #archiveteam-bs
12:33 🔗 schbirid has quit IRC (Read error: Operation timed out)
12:45 🔗 schbirid has joined #archiveteam-bs
12:56 🔗 Ravenloft has quit IRC (Remote host closed the connection)
13:25 🔗 godane has joined #archiveteam-bs
13:27 🔗 godane i'm back
13:27 🔗 godane i'm on my new system
13:28 🔗 godane i found a work around to boot into my ext4 partition even when UEFI doesn't see it
13:39 🔗 tomwsmf-a has joined #archiveteam-bs
13:54 🔗 godane1 has joined #archiveteam-bs
13:55 🔗 godane has quit IRC (Read error: Operation timed out)
13:56 🔗 xtr-201 has joined #archiveteam-bs
14:09 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
14:15 🔗 xtr-201 has quit IRC (Read error: Operation timed out)
14:16 🔗 dashcloud has quit IRC (Read error: Operation timed out)
14:31 🔗 xtr-201 has joined #archiveteam-bs
14:31 🔗 dashcloud has joined #archiveteam-bs
14:33 🔗 godane1 has quit IRC (Ping timeout: 306 seconds)
14:40 🔗 tomwsmf-a has joined #archiveteam-bs
14:42 🔗 godane has joined #archiveteam-bs
14:56 🔗 kyan has quit IRC (Quit: This computer has gone to sleep)
14:57 🔗 dashcloud has quit IRC (Read error: Operation timed out)
15:03 🔗 dashcloud has joined #archiveteam-bs
15:04 🔗 primus104 has quit IRC (Leaving.)
15:09 🔗 SadDM has quit IRC (Ping timeout: 483 seconds)
15:12 🔗 SadDM has joined #archiveteam-bs
15:20 🔗 tomwsmf-a has quit IRC (Ping timeout: 258 seconds)
15:30 🔗 SimpBrain has joined #archiveteam-bs
15:36 🔗 dashcloud has quit IRC (Read error: Connection reset by peer)
15:40 🔗 dashcloud has joined #archiveteam-bs
15:56 🔗 tomwsmf-a has joined #archiveteam-bs
16:03 🔗 SimpBrain has quit IRC (Quit: Leaving)
16:04 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
16:06 🔗 SimpBrain has joined #archiveteam-bs
16:24 🔗 JesseW has joined #archiveteam-bs
16:27 🔗 JesseW has left
16:29 🔗 Kirk has joined #archiveteam-bs
16:29 🔗 raylee has joined #archiveteam-bs
16:29 🔗 wm_ has joined #archiveteam-bs
16:36 🔗 raylee is now known as Rye
16:43 🔗 mistym has joined #archiveteam-bs
16:45 🔗 vitzli has quit IRC (Quit: Leaving)
17:18 🔗 godane has quit IRC (Quit: Leaving.)
17:20 🔗 Ravenloft has joined #archiveteam-bs
17:34 🔗 aaaaaaaaa has joined #archiveteam-bs
17:36 🔗 dashcloud has quit IRC (Ping timeout: 483 seconds)
17:42 🔗 dashcloud has joined #archiveteam-bs
17:45 🔗 primus104 has joined #archiveteam-bs
18:03 🔗 aaaaaaaaa has quit IRC (Leaving)
18:03 🔗 mistym has quit IRC (Ping timeout: 252 seconds)
18:06 🔗 godane has joined #archiveteam-bs
18:06 🔗 mistym has joined #archiveteam-bs
18:07 🔗 godane looks like we could grab speedtest.net
18:08 🔗 godane it goes as far back as march 2007: http://www.speedtest.net/my-result/106000000
18:10 🔗 aaaaaaaaa has joined #archiveteam-bs
18:11 🔗 midas thats cool godane :D
18:15 🔗 godane i figure it could be useful data
18:16 🔗 godane we can then look at data by isp over time based on location
18:20 🔗 primus104 has quit IRC (Read error: Connection reset by peer)
18:26 🔗 primus104 has joined #archiveteam-bs
19:04 🔗 mistym has quit IRC (Ping timeout: 252 seconds)
19:07 🔗 mistym has joined #archiveteam-bs
19:16 🔗 Start has quit IRC (Quit: Disconnected.)
19:17 🔗 Start has joined #archiveteam-bs
19:17 🔗 JesseW has joined #archiveteam-bs
19:47 🔗 tomwsmf-a has joined #archiveteam-bs
19:51 🔗 tomwsmf-a has quit IRC (Ping timeout: 258 seconds)
19:55 🔗 JesseW has quit IRC (Quit: Leaving.)
20:07 🔗 JesseW has joined #archiveteam-bs
20:11 🔗 tomwsmf-a has joined #archiveteam-bs
20:13 🔗 JesseW has quit IRC (Client Quit)
20:16 🔗 JesseW has joined #archiveteam-bs
20:20 🔗 mistym_ has joined #archiveteam-bs
20:26 🔗 mistym has quit IRC (Read error: Operation timed out)
20:37 🔗 rejk has joined #archiveteam-bs
20:45 🔗 JesseW has quit IRC (Leaving.)
20:46 🔗 aaaaaaaaa Well, looks like Microsoft has in app purchases in the default software: http://www.wired.co.uk/news/archive/2015-07/30/windows-10-paid-ad-removal-solitaire
20:47 🔗 rejk no surprising since the xbox one dashboard is covered in ads. fkn bs
21:22 🔗 SimpBrain has quit IRC (Read error: Connection reset by peer)
21:22 🔗 SimpBrai1 has joined #archiveteam-bs
21:25 🔗 SimpBrai1 has quit IRC (Read error: Connection reset by peer)
21:27 🔗 schbirid has quit IRC (Leaving)
21:31 🔗 SimpBrain has joined #archiveteam-bs
21:36 🔗 RichardG has quit IRC (Read error: Connection reset by peer)
21:38 🔗 RichardG has joined #archiveteam-bs
21:43 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
21:52 🔗 mistym has joined #archiveteam-bs
21:57 🔗 mistym__ has joined #archiveteam-bs
21:57 🔗 mistym_ has quit IRC (Ping timeout: 483 seconds)
21:59 🔗 mistym has quit IRC (Read error: Operation timed out)
22:02 🔗 mistym has joined #archiveteam-bs
22:07 🔗 mistym__ has quit IRC (Ping timeout: 606 seconds)
22:10 🔗 primus104 has quit IRC (Leaving.)
22:14 🔗 tomwsmf-a has joined #archiveteam-bs
22:27 🔗 lexicon has joined #archiveteam-bs
22:27 🔗 lexicon is now known as lexiconda
22:27 🔗 lexiconda is now known as lexicon
22:40 🔗 mistym has quit IRC (Remote host closed the connection)
22:43 🔗 dashcloud aaaaaaaaa: apparently that's not new- it's been there since windows 8, but since no one used that, no one noticed until now
23:17 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
23:21 🔗 SimpBrain has quit IRC (Leaving)
23:41 🔗 SketchCow Right
23:41 🔗 SketchCow http://fffff.at/
23:41 🔗 mistym has joined #archiveteam-bs
23:48 🔗 tomwsmf-a has joined #archiveteam-bs
23:52 🔗 mistym has quit IRC (Ping timeout: 606 seconds)

irclogger-viewer