#archiveteam 2018-03-24,Sat

โ†‘back Search

Time Nickname Message
00:07 ๐Ÿ”— odemg has quit IRC (Read error: Operation timed out)
00:16 ๐Ÿ”— odemg has joined #archiveteam
01:27 ๐Ÿ”— bwn has quit IRC (Read error: Connection reset by peer)
01:38 ๐Ÿ”— Mateon1 has quit IRC (Remote host closed the connection)
01:38 ๐Ÿ”— Mateon1 has joined #archiveteam
01:45 ๐Ÿ”— bwn has joined #archiveteam
02:02 ๐Ÿ”— kitties has joined #archiveteam
02:19 ๐Ÿ”— BlueMax has joined #archiveteam
04:05 ๐Ÿ”— RichardG_ has quit IRC (Read error: Connection reset by peer)
04:06 ๐Ÿ”— RichardG has joined #archiveteam
04:17 ๐Ÿ”— qw3rty116 has joined #archiveteam
04:23 ๐Ÿ”— qw3rty115 has quit IRC (Read error: Operation timed out)
06:19 ๐Ÿ”— Pixi has quit IRC (Ping timeout: 255 seconds)
06:25 ๐Ÿ”— Pixi has joined #archiveteam
06:46 ๐Ÿ”— ndiddy has quit IRC ()
06:56 ๐Ÿ”— Fletcher has quit IRC (Read error: Operation timed out)
07:17 ๐Ÿ”— Fletcher has joined #archiveteam
07:24 ๐Ÿ”— kitties has quit IRC (Connection closed for inactivity)
08:29 ๐Ÿ”— tomaspark has quit IRC (Read error: Operation timed out)
08:29 ๐Ÿ”— tomaspark has joined #archiveteam
08:33 ๐Ÿ”— plue has quit IRC (Ping timeout: 260 seconds)
08:41 ๐Ÿ”— plue has joined #archiveteam
09:12 ๐Ÿ”— BlueMax has quit IRC (Leaving)
09:59 ๐Ÿ”— bwn has quit IRC (Read error: Operation timed out)
10:06 ๐Ÿ”— bwn has joined #archiveteam
10:54 ๐Ÿ”— db48x has quit IRC (Read error: Operation timed out)
11:23 ๐Ÿ”— db48x has joined #archiveteam
11:53 ๐Ÿ”— wp494_ has joined #archiveteam
11:53 ๐Ÿ”— bwn has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— qw3rty116 has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— Zialus has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— Mayonaise has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— phq__ has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— twigfoot has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— unlobito has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— FireFly has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— ivan has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— nwf has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— beardicus has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— SirCmpwn has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— Gfy has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— muramasa has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— JAA has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— MMovie has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— aMunster has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— PotcFdk has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— C4K3 has quit IRC (ny.us.hub ircd.choopa.net)
11:53 ๐Ÿ”— TigerbotH has quit IRC (ny.us.hub ircd.choopa.net)
11:55 ๐Ÿ”— Gfy_ has joined #archiveteam
11:56 ๐Ÿ”— wp494 has quit IRC (Ping timeout: 244 seconds)
12:10 ๐Ÿ”— bwn has joined #archiveteam
12:10 ๐Ÿ”— qw3rty116 has joined #archiveteam
12:10 ๐Ÿ”— Zialus has joined #archiveteam
12:10 ๐Ÿ”— Mayonaise has joined #archiveteam
12:10 ๐Ÿ”— phq__ has joined #archiveteam
12:10 ๐Ÿ”— twigfoot has joined #archiveteam
12:10 ๐Ÿ”— unlobito has joined #archiveteam
12:10 ๐Ÿ”— ivan has joined #archiveteam
12:10 ๐Ÿ”— nwf has joined #archiveteam
12:10 ๐Ÿ”— beardicus has joined #archiveteam
12:10 ๐Ÿ”— SirCmpwn has joined #archiveteam
12:10 ๐Ÿ”— JAA has joined #archiveteam
12:10 ๐Ÿ”— TigerbotH has joined #archiveteam
12:10 ๐Ÿ”— aMunster has joined #archiveteam
12:10 ๐Ÿ”— PotcFdk has joined #archiveteam
12:10 ๐Ÿ”— C4K3 has joined #archiveteam
12:10 ๐Ÿ”— ircd.choopa.net sets mode: +oo beardicus JAA
12:10 ๐Ÿ”— swebb sets mode: +o beardicus
12:10 ๐Ÿ”— swebb sets mode: +o JAA
12:11 ๐Ÿ”— odemg has quit IRC (Ping timeout: 268 seconds)
12:38 ๐Ÿ”— khaoohs has quit IRC (Read error: Connection reset by peer)
12:41 ๐Ÿ”— odemg has joined #archiveteam
13:37 ๐Ÿ”— Mateon1 has quit IRC (Remote host closed the connection)
13:37 ๐Ÿ”— Mateon1 has joined #archiveteam
13:37 ๐Ÿ”— Gfy_ is now known as Gfy
13:57 ๐Ÿ”— Mateon1 has quit IRC (Read error: Operation timed out)
13:57 ๐Ÿ”— Mateon1 has joined #archiveteam
16:02 ๐Ÿ”— MrDignity has quit IRC (Remote host closed the connection)
16:02 ๐Ÿ”— MrDignity has joined #archiveteam
16:12 ๐Ÿ”— indrora has quit IRC (Quit: Hฬƒอฏฬฬ‰ฬ…ฬอŒอ€ฬขฬฐฬฒอˆฬฑฬชฬฃอ…อฬผฬฆeอจฬอ†ฬ‹อคองอฅฬฟอ’อ‹ฬ„ฬฬฬ†ฬ”อ’อ‹ฬ‘อฎฬธอออ•ฬ อŽฬบอ• อŒฬ‰ฬŽอŒฬŠอ‘ฬ‚อฅฬ‡๏ฟฝ)
16:36 ๐Ÿ”— RichardG has quit IRC (Read error: Connection reset by peer)
16:38 ๐Ÿ”— RichardG has joined #archiveteam
16:45 ๐Ÿ”— MMovie has joined #archiveteam
16:52 ๐Ÿ”— muramasa has joined #archiveteam
17:07 ๐Ÿ”— atrocity has quit IRC (Read error: Operation timed out)
17:13 ๐Ÿ”— godane has quit IRC (Quit: Leaving.)
17:21 ๐Ÿ”— Martle has joined #archiveteam
17:25 ๐Ÿ”— SoniEx2 has joined #archiveteam
17:26 ๐Ÿ”— SoniEx2 has quit IRC (Client Quit)
17:35 ๐Ÿ”— Sanqui !a http://www.geocities.co.jp/SiliconValley-Sunnyvale/6160/
17:36 ๐Ÿ”— godane has joined #archiveteam
17:55 ๐Ÿ”— atrocity has joined #archiveteam
18:27 ๐Ÿ”— znak Sanqui: I saw that Poema.pl finished, good job and thanks :)
18:27 ๐Ÿ”— Sanqui znak: was a pretty quick job :)
18:33 ๐Ÿ”— Sanqui https://www.craigslist.org/about/FOSTA
18:35 ๐Ÿ”— Sanqui craiglist personals are gone (for the us version of the site)
19:04 ๐Ÿ”— wp494_ is now known as wp494
19:04 ๐Ÿ”— wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES)
19:05 ๐Ÿ”— wp494 has joined #archiveteam
19:47 ๐Ÿ”— tomaspark has quit IRC (Remote host closed the connection)
20:39 ๐Ÿ”— Evalelynn has joined #archiveteam
20:40 ๐Ÿ”— RichardG has quit IRC (Read error: Connection reset by peer)
20:40 ๐Ÿ”— Evalelynn has quit IRC (Client Quit)
20:42 ๐Ÿ”— RichardG has joined #archiveteam
22:03 ๐Ÿ”— BlueMax has joined #archiveteam
22:34 ๐Ÿ”— jschwart has quit IRC (Quit: Konversation terminated!)
22:36 ๐Ÿ”— REiN^ has joined #archiveteam
22:57 ๐Ÿ”— Asparagir has joined #archiveteam
23:03 ๐Ÿ”— bug_ has joined #archiveteam
23:07 ๐Ÿ”— bug_ I have a question. I'm interested in looking up specific parts of the fanfiction dot net archive scrape in 2012, but only for a couple portions of the archive, and the 50+ gb files on the internet archive are a bit daunting. Is there some sort of master directory that tells you which part of the site is scraped in a specific WARC dump?
23:24 ๐Ÿ”— icedice has joined #archiveteam
23:25 ๐Ÿ”— JAA bug_: There's a CDX file for each WARC, which contains a list of all entries inside the WARC. Among other things, it also contains the offset and length of each record, so you can use HTTP range requests to only download that part of the WARC as well.
23:27 ๐Ÿ”— Asparagir has quit IRC (Asparagir)
23:28 ๐Ÿ”— bug_ Ah, thank you! What would the difference between archiveteam-fanfiction-warc-01.cdx.gz and 00000001.tar.megawarc.warc.os.cdx.gz be, in that case, since they're both labeled as CDX? Would it just be the file type (sorry for the questions!)
23:29 ๐Ÿ”— JAA bug_: What's the link to the item? Also, let's move this to #archiveteam-bs (this channel is mainly for announcements).
23:36 ๐Ÿ”— icedice has quit IRC (Quit: Leaving)
23:58 ๐Ÿ”— balrog has quit IRC (Read error: Operation timed out)

irclogger-viewer