#archiveteam 2018-03-24,Sat

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***odemg has quit IRC (Read error: Operation timed out) [00:07]
odemg has joined #archiveteam [00:16]
............... (idle for 1h11mn)
bwn has quit IRC (Read error: Connection reset by peer) [01:27]
Mateon1 has quit IRC (Remote host closed the connection)
Mateon1 has joined #archiveteam
[01:38]
bwn has joined #archiveteam [01:45]
.... (idle for 17mn)
kitties has joined #archiveteam [02:02]
.... (idle for 17mn)
BlueMax has joined #archiveteam [02:19]
...................... (idle for 1h46mn)
RichardG_ has quit IRC (Read error: Connection reset by peer)
RichardG has joined #archiveteam
[04:05]
qw3rty116 has joined #archiveteam [04:17]
qw3rty115 has quit IRC (Read error: Operation timed out) [04:23]
........................ (idle for 1h56mn)
Pixi has quit IRC (Ping timeout: 255 seconds) [06:19]
Pixi has joined #archiveteam [06:25]
..... (idle for 21mn)
ndiddy has quit IRC () [06:46]
Fletcher has quit IRC (Read error: Operation timed out) [06:56]
..... (idle for 21mn)
Fletcher has joined #archiveteam [07:17]
kitties has quit IRC (Connection closed for inactivity) [07:24]
.............. (idle for 1h5mn)
tomaspark has quit IRC (Read error: Operation timed out)
tomaspark has joined #archiveteam
plue has quit IRC (Ping timeout: 260 seconds)
[08:29]
plue has joined #archiveteam [08:41]
....... (idle for 31mn)
BlueMax has quit IRC (Leaving) [09:12]
.......... (idle for 47mn)
bwn has quit IRC (Read error: Operation timed out) [09:59]
bwn has joined #archiveteam [10:06]
.......... (idle for 48mn)
db48x has quit IRC (Read error: Operation timed out) [10:54]
...... (idle for 29mn)
db48x has joined #archiveteam [11:23]
....... (idle for 30mn)
wp494_ has joined #archiveteam
bwn has quit IRC (ny.us.hub ircd.choopa.net)
qw3rty116 has quit IRC (ny.us.hub ircd.choopa.net)
Zialus has quit IRC (ny.us.hub ircd.choopa.net)
Mayonaise has quit IRC (ny.us.hub ircd.choopa.net)
phq__ has quit IRC (ny.us.hub ircd.choopa.net)
twigfoot has quit IRC (ny.us.hub ircd.choopa.net)
unlobito has quit IRC (ny.us.hub ircd.choopa.net)
FireFly has quit IRC (ny.us.hub ircd.choopa.net)
ivan has quit IRC (ny.us.hub ircd.choopa.net)
nwf has quit IRC (ny.us.hub ircd.choopa.net)
beardicus has quit IRC (ny.us.hub ircd.choopa.net)
SirCmpwn has quit IRC (ny.us.hub ircd.choopa.net)
Gfy has quit IRC (ny.us.hub ircd.choopa.net)
muramasa has quit IRC (ny.us.hub ircd.choopa.net)
JAA has quit IRC (ny.us.hub ircd.choopa.net)
MMovie has quit IRC (ny.us.hub ircd.choopa.net)
aMunster has quit IRC (ny.us.hub ircd.choopa.net)
PotcFdk has quit IRC (ny.us.hub ircd.choopa.net)
C4K3 has quit IRC (ny.us.hub ircd.choopa.net)
TigerbotH has quit IRC (ny.us.hub ircd.choopa.net)
Gfy_ has joined #archiveteam
wp494 has quit IRC (Ping timeout: 244 seconds)
[11:53]
bwn has joined #archiveteam
qw3rty116 has joined #archiveteam
Zialus has joined #archiveteam
Mayonaise has joined #archiveteam
phq__ has joined #archiveteam
twigfoot has joined #archiveteam
unlobito has joined #archiveteam
ivan has joined #archiveteam
nwf has joined #archiveteam
beardicus has joined #archiveteam
SirCmpwn has joined #archiveteam
JAA has joined #archiveteam
TigerbotH has joined #archiveteam
aMunster has joined #archiveteam
PotcFdk has joined #archiveteam
C4K3 has joined #archiveteam
ircd.choopa.net sets mode: +oo beardicus JAA
swebb sets mode: +o beardicus
swebb sets mode: +o JAA
odemg has quit IRC (Ping timeout: 268 seconds)
[12:10]
...... (idle for 27mn)
khaoohs has quit IRC (Read error: Connection reset by peer)
odemg has joined #archiveteam
[12:38]
............ (idle for 56mn)
Mateon1 has quit IRC (Remote host closed the connection)
Mateon1 has joined #archiveteam
Gfy_ is now known as Gfy
[13:37]
..... (idle for 20mn)
Mateon1 has quit IRC (Read error: Operation timed out)
Mateon1 has joined #archiveteam
[13:57]
.......................... (idle for 2h5mn)
MrDignity has quit IRC (Remote host closed the connection)
MrDignity has joined #archiveteam
[16:02]
indrora has quit IRC (Quit: H̢̰̲͈̱̪̣͍̼̦̃ͯ́̉̅̐͌̀ͅe̸ͨ̐͆̋ͤͧͥ̿͒͋̄̐̏̆̔͒͋̑ͮ͝͏͕̠͎̺͕ ͌̉̎͌̊͑̂ͥ̇) [16:12]
..... (idle for 24mn)
RichardG has quit IRC (Read error: Connection reset by peer)
RichardG has joined #archiveteam
[16:36]
MMovie has joined #archiveteam [16:45]
muramasa has joined #archiveteam [16:52]
.... (idle for 15mn)
atrocity has quit IRC (Read error: Operation timed out) [17:07]
godane has quit IRC (Quit: Leaving.) [17:13]
Martle has joined #archiveteam
SoniEx2 has joined #archiveteam
SoniEx2 has quit IRC (Client Quit)
[17:21]
Sanqui!a http://www.geocities.co.jp/SiliconValley-Sunnyvale/6160/ [17:35]
***godane has joined #archiveteam [17:36]
.... (idle for 19mn)
atrocity has joined #archiveteam [17:55]
....... (idle for 32mn)
znakSanqui: I saw that Poema.pl finished, good job and thanks :) [18:27]
Sanquiznak: was a pretty quick job :) [18:27]
https://www.craigslist.org/about/FOSTA
craiglist personals are gone (for the us version of the site)
[18:33]
...... (idle for 29mn)
***wp494_ is now known as wp494
wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES)
wp494 has joined #archiveteam
[19:04]
......... (idle for 42mn)
tomaspark has quit IRC (Remote host closed the connection) [19:47]
........... (idle for 52mn)
Evalelynn has joined #archiveteam
RichardG has quit IRC (Read error: Connection reset by peer)
Evalelynn has quit IRC (Client Quit)
RichardG has joined #archiveteam
[20:39]
................. (idle for 1h21mn)
BlueMax has joined #archiveteam [22:03]
....... (idle for 31mn)
jschwart has quit IRC (Quit: Konversation terminated!)
REiN^ has joined #archiveteam
[22:34]
..... (idle for 21mn)
Asparagir has joined #archiveteam [22:57]
bug_ has joined #archiveteam [23:03]
bug_I have a question. I'm interested in looking up specific parts of the fanfiction dot net archive scrape in 2012, but only for a couple portions of the archive, and the 50+ gb files on the internet archive are a bit daunting. Is there some sort of master directory that tells you which part of the site is scraped in a specific WARC dump? [23:07]
.... (idle for 17mn)
***icedice has joined #archiveteam [23:24]
JAAbug_: There's a CDX file for each WARC, which contains a list of all entries inside the WARC. Among other things, it also contains the offset and length of each record, so you can use HTTP range requests to only download that part of the WARC as well. [23:25]
***Asparagir has quit IRC (Asparagir) [23:27]
bug_Ah, thank you! What would the difference between archiveteam-fanfiction-warc-01.cdx.gz and 00000001.tar.megawarc.warc.os.cdx.gz be, in that case, since they're both labeled as CDX? Would it just be the file type (sorry for the questions!) [23:28]
JAAbug_: What's the link to the item? Also, let's move this to #archiveteam-bs (this channel is mainly for announcements). [23:29]
***icedice has quit IRC (Quit: Leaving) [23:36]
..... (idle for 22mn)
balrog has quit IRC (Read error: Operation timed out) [23:58]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)