Time |
Nickname |
Message |
00:06
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
00:07
🔗
|
|
MMovie has joined #archiveteam |
00:22
🔗
|
|
Stilett0 is now known as Stiletto |
00:35
🔗
|
|
redlob has quit IRC (Read error: Operation timed out) |
00:37
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
00:39
🔗
|
|
dashcloud has joined #archiveteam |
00:45
🔗
|
|
redlob has joined #archiveteam |
00:53
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
00:55
🔗
|
|
MMovie has joined #archiveteam |
01:08
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
01:09
🔗
|
|
MMovie has joined #archiveteam |
01:11
🔗
|
|
JesseW has joined #archiveteam |
01:12
🔗
|
FalconK |
so FOS is still looking really congested |
01:12
🔗
|
FalconK |
like 50-100kB/s even after connections were limited to 50 concurrent |
01:12
🔗
|
FalconK |
is there some way I can configure it to assemble ~25GB megawarcs and upload more directly to IA? |
01:22
🔗
|
yipdw |
yeah, you'll want the archiveteam-megawarc-factory git repo, an IA account, IA-S3 keys (https://archive.org/account/s3.php) and ideally access to the collection you want to put stuff in |
01:22
🔗
|
yipdw |
I can assist with setup of the whole smash except for the last bit, though I am booked until tomorrow |
01:23
🔗
|
yipdw |
I think there must be others who know how it works, though -- xmc chfoo ersi joepie91 maybe |
01:23
🔗
|
yipdw |
or they can be dereferenced |
01:23
🔗
|
xmc |
yes hi |
01:23
🔗
|
FalconK |
yeah xmc is to my left. |
01:23
🔗
|
|
n00b599 has joined #archiveteam |
01:23
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
01:23
🔗
|
yipdw |
wait, physically |
01:23
🔗
|
n00b599 |
r/ring secret word. |
01:23
🔗
|
* |
xmc waves |
01:23
🔗
|
FalconK |
I can do the software thing but the account and keys and access I obviously need help with :) |
01:24
🔗
|
FalconK |
not super urgent - tomorrow is just fine |
01:24
🔗
|
|
dashcloud has joined #archiveteam |
01:24
🔗
|
yipdw |
IA account is just any IA account, keys can be generated at that link |
01:24
🔗
|
yipdw |
the access thing, ping SketchCow |
01:24
🔗
|
* |
FalconK pings at SketchCow |
01:24
🔗
|
n00b599 |
Didn't work. |
01:25
🔗
|
n00b599 |
Oh got it |
01:25
🔗
|
xmc |
yahoosucks |
01:26
🔗
|
n00b599 |
Thank ya |
01:27
🔗
|
xmc |
sure thing |
01:27
🔗
|
xmc |
what're you up to? |
01:29
🔗
|
|
Anon has joined #archiveteam |
01:29
🔗
|
|
Anon has quit IRC (Client Quit) |
01:30
🔗
|
|
Anon has joined #archiveteam |
01:31
🔗
|
Anon |
loveisover is down. |
01:31
🔗
|
|
Anon has quit IRC (Client Quit) |
01:40
🔗
|
|
philpem has quit IRC (Ping timeout: 260 seconds) |
01:47
🔗
|
FalconK |
can I not just upload things to the archivebot collection? |
01:47
🔗
|
FalconK |
perhaps I will try that |
01:49
🔗
|
xmc |
i think that perhaps we should consider giving each pipeline operator a sub-collection inside of archivebot, or maybe just privileges to upload into it |
01:49
🔗
|
xmc |
hm |
01:49
🔗
|
xmc |
i'm not sure |
01:49
🔗
|
xmc |
there are many ways that we could do this differently |
01:53
🔗
|
yipdw |
oh |
01:53
🔗
|
yipdw |
archivebot collection |
01:53
🔗
|
yipdw |
yeah I think SketchCow's the only one who has that |
01:53
🔗
|
* |
yipdw doesn't have that access |
01:54
🔗
|
xmc |
yeah |
01:54
🔗
|
xmc |
sounds about right |
01:54
🔗
|
xmc |
we've been talking about this for a little while and i still can't think of a good reason for archivebot to depend on fos |
02:01
🔗
|
n00b599 |
Oh were you talking to me at 19:27? |
02:02
🔗
|
xmc |
it's not even 19:00, friend |
02:03
🔗
|
MrRadar |
Central time zone best time zone |
02:03
🔗
|
* |
MrRadar puts on shades |
02:04
🔗
|
n00b599 |
YEEEEEEEEEEEEEEEEEEEAAAAAAAAAAAAAAAAAH!!! |
02:05
🔗
|
n00b599 |
I'm from Miami for real, so I guess you can say that meme comes with the nativety. |
02:06
🔗
|
n00b599 |
It's arguably as good or better than Ocarina of Time in some ways. In some ways it's not. I beat it 10 years ago on GCN and don't remember any of it so I've been playing it on the 3DS and it's awesome. I've died and reset the game from the beginning twice in the Water Temple already (the first time I ran out of oxygen in a room where I couldn't figure out what to do and the second time I got killed by Stalfos) so I have |
02:06
🔗
|
xmc |
woop woop woop off-topic siren |
02:06
🔗
|
ErkDog |
Arguabely, it's better to have a single collection point, so you aren't dealing with maintaining multiple systems |
02:06
🔗
|
n00b599 |
I was testing how the IRC formatted.. That's what I was typing out when you asked what I was up to. |
02:07
🔗
|
xmc |
ErkDog: well it's overloaded and slowing down archivebot |
02:07
🔗
|
n00b599 |
Do you guys 4chan? |
02:08
🔗
|
xmc |
why do you ask? |
02:08
🔗
|
n00b599 |
That's what I signed up here for. That and I'm an archivist in my own right. I'm surprised I never heard of this site. |
02:08
🔗
|
xmc |
are you ANOTHER 4chan archiver? |
02:08
🔗
|
n00b599 |
I |
02:08
🔗
|
n00b599 |
I |
02:08
🔗
|
n00b599 |
I'm looking for the data from May to October 4th. |
02:09
🔗
|
n00b599 |
Negatory. |
02:09
🔗
|
xmc |
for what |
02:09
🔗
|
n00b599 |
Just a frequent poster who had a lot of posts I wanted to reread. |
02:09
🔗
|
xmc |
beats me. we don't archive 4chan actively. |
02:09
🔗
|
n00b599 |
Hisotircal preservation on the note as well. |
02:10
🔗
|
n00b599 |
Ah, I see. What I'm referring to is a good Samaritan that I read about on the 4chanarchive page who had a private archive and was volunteering to share it. |
02:10
🔗
|
xmc |
no, this is not an archive warez channel, we don't keep a meat-index of stuff |
02:10
🔗
|
xmc |
all I can say is http://archive.org/search.php |
02:11
🔗
|
n00b599 |
Oh that's what you're affiliated directly with? |
02:11
🔗
|
xmc |
no |
02:11
🔗
|
MrRadar |
We use the IA as a repository for our work but there is no official connection |
02:11
🔗
|
xmc |
but things wind up there |
02:11
🔗
|
n00b599 |
Gotcha |
02:12
🔗
|
MrRadar |
If you know someone with archived data to share, ask him to upload it there |
02:12
🔗
|
MrRadar |
That way it will be preserved |
02:12
🔗
|
n00b599 |
Well. if nobody's heard what I'm talking about, I think I found what I need. I'll just make a section on the discussion page/ |
02:12
🔗
|
n00b599 |
*nod*nod* |
02:24
🔗
|
|
lokis has joined #archiveteam |
02:24
🔗
|
n00b599 |
o/ |
02:25
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
02:26
🔗
|
|
MMovie has joined #archiveteam |
02:27
🔗
|
ErkDog |
yeah it is XMC, lol |
02:28
🔗
|
ErkDog |
I mentioned previously that it might be a good idea to have reginal RSYnc systems so like for instance, I could pick a target in the US |
02:28
🔗
|
ErkDog |
EU people could pick EU |
02:29
🔗
|
ErkDog |
then those targets could single threaded push stuff over to FOS so it wasn't gettings it's IO thrashed by 50 simultaneous RSYNCHs coming in |
02:29
🔗
|
ErkDog |
but instead 2 or 3 |
02:29
🔗
|
ErkDog |
and then the back end could keep up |
02:29
🔗
|
FalconK |
there is no point in using rsync for this at all |
02:29
🔗
|
FalconK |
since it looks like FOS is merely forwarding the archives as-is into IA collection archivebot |
02:29
🔗
|
FalconK |
just send them directly using the HTTP REST API |
02:30
🔗
|
xmc |
i don't think ia supports giving multiple users access to a restricted collection? |
02:30
🔗
|
FalconK |
like bouncing them through FOS is a waste of bandwidth |
02:30
🔗
|
ErkDog |
not really because archive bot is on the same network |
02:30
🔗
|
xmc |
but subcollections, one per archivebot pipeline owner, make sense |
02:30
🔗
|
ErkDog |
have you ever tracerouted FOS? |
02:30
🔗
|
FalconK |
yes, it's in IA |
02:30
🔗
|
FalconK |
that is fine |
02:30
🔗
|
FalconK |
it's clearly overloaded, and a bottleneck, and it doesn't need to be so it shou;dn't be |
02:31
🔗
|
FalconK |
we're not saturating IA downstream |
02:31
🔗
|
FalconK |
or our upstream |
02:31
🔗
|
ErkDog |
yes, but that's because FOS is dealing with 50 incoming rsynch's |
02:31
🔗
|
xmc |
point of order, i would say "inbound" and "outbound" |
02:31
🔗
|
ErkDog |
instead of 2 or 3 from front end systems |
02:31
🔗
|
FalconK |
so it doesn't matter why |
02:31
🔗
|
xmc |
the bottleneck is in FOS's disk bandwidth |
02:31
🔗
|
FalconK |
there is just literally no reason to have that extra infrastructure |
02:32
🔗
|
ErkDog |
well if Archive Bot could pull the data from the other RSynch servers, then sure |
02:32
🔗
|
FalconK |
why use rsync at all? |
02:32
🔗
|
ErkDog |
because rsync does CRC checking |
02:32
🔗
|
ErkDog |
HTTP and standard FTP don't |
02:33
🔗
|
FalconK |
the BER is probably the same, since FOS uploads over a network too |
02:33
🔗
|
FalconK |
also doesn't WARC have a CRC? |
02:34
🔗
|
ErkDog |
I wouldn't call a transfer inside the same building a "network" upload that's succeptible to the same potential for corruption as sending data across the open internet |
02:34
🔗
|
ErkDog |
I'm not sure if WARC does or not, but if it did, you'd have to do the CRC after transmit, and then retransmit the entire WARC file if it failed, rsync does the CRC in real time as things are transmitted |
02:34
🔗
|
ErkDog |
why do you h8 rsync? did it murder your family? |
02:36
🔗
|
xmc |
TCP has a CRC |
02:36
🔗
|
xmc |
on every packet! |
02:36
🔗
|
MrRadar |
And yes WARC files have checksums on each record too |
02:36
🔗
|
xmc |
it's part of the error control mechanism of the internet at large |
02:36
🔗
|
xmc |
it works pretty well |
02:36
🔗
|
n00b599 |
I got a bad vibe. |
02:36
🔗
|
xmc |
i've never received an accidentally corrupted tcp stream |
02:37
🔗
|
ErkDog |
ohhh good point TCP does |
02:37
🔗
|
n00b599 |
What;s a TCP screen? |
02:37
🔗
|
yipdw |
so |
02:37
🔗
|
yipdw |
if someone wants to write a different uploader for ArchiveBot, I am down with that |
02:37
🔗
|
ErkDog |
however, that doesn't change the fact, that I've personally seen, and experienced FTP and HTTP transfers, using TCP which ended up corrupted and required re-transmittal |
02:37
🔗
|
FalconK |
I am going to, yipdw :) |
02:37
🔗
|
FalconK |
already in progress :) |
02:37
🔗
|
yipdw |
ideally, it would just require replacement of uploader.py |
02:38
🔗
|
FalconK |
yup |
02:38
🔗
|
xmc |
there. blessings from two people with @ and i'm sure SketchCow will be happy to get it off FOS |
02:38
🔗
|
|
marvinw_ is now known as ivan` |
02:38
🔗
|
yipdw |
it would however require each operator to have upload privileges into the archivebot collection, as well as a way to generate item names |
02:38
🔗
|
xmc |
is why i propose a subcollection for operators |
02:39
🔗
|
xmc |
though that would mean that whoever is in charge of archivebot have permissions to create these collections |
02:39
🔗
|
xmc |
my knowledge of IA's permissions mechanisms is, sadly, lacking |
02:39
🔗
|
yipdw |
anyway we'll get the tool first and figure out access later |
02:39
🔗
|
* |
xmc nods |
02:39
🔗
|
yipdw |
and are people seriously talking about TCP checksums and corrupted data w tf |
02:40
🔗
|
* |
yipdw alt+tab |
02:40
🔗
|
ErkDog |
lol yipdw |
02:40
🔗
|
xmc |
ALSO |
02:40
🔗
|
xmc |
having the pipeline operator upload directly actually gives us a much better audit trail |
02:40
🔗
|
xmc |
which is a thing i've been thinking about for a little while |
02:41
🔗
|
ErkDog |
Well without changing the existing access permissions, having regional front end rsynch targets which then sent single threaded to FOS would lessen it's disk I/O significantly. |
02:41
🔗
|
ErkDog |
50 RSynchs run more than 50 times slower than 1, even 2 or 3 |
02:41
🔗
|
xmc |
why are you so stuck on rsync |
02:41
🔗
|
ErkDog |
well cause that's just how it works now |
02:41
🔗
|
yipdw |
so |
02:41
🔗
|
yipdw |
we've seen this behavior, yes |
02:41
🔗
|
ErkDog |
and would be easier to change the work flow |
02:41
🔗
|
xmc |
it works that way now because when we wrote it we were feeling lazy |
02:42
🔗
|
ErkDog |
than having IA give access permissions that they may or may not be willing to do |
02:42
🔗
|
xmc |
sometimes it is the correct time to redesign things |
02:42
🔗
|
xmc |
now is, apparently, the correct time |
02:42
🔗
|
MrRadar |
SketchCow can sort out any IA permissions we need. It's not an issue |
02:42
🔗
|
yipdw |
well that and if you give each operator subcollections the problem goes away, ish |
02:42
🔗
|
ErkDog |
OMG, if I could submit work loads at faster than 100K/sec I would be immensely happy :-D |
02:42
🔗
|
FalconK |
yes |
02:43
🔗
|
FalconK |
we share a common goal :D |
02:43
🔗
|
yipdw |
this isn't an invitation to shove fucking en.wikipedia.org into archivebot |
02:43
🔗
|
FalconK |
!a en.wikipedia.org |
02:43
🔗
|
FalconK |
er |
02:43
🔗
|
FalconK |
:P |
02:43
🔗
|
xmc |
you need http:// |
02:43
🔗
|
FalconK |
I think you can just download their whole SQL database anyway. no reason to crawl it. |
02:43
🔗
|
xmc |
that's what WIKITEAM is for |
02:43
🔗
|
xmc |
i think Nemo_bis is in charge of that |
02:44
🔗
|
yipdw |
come to think of it, ia upload plus shell script is about all you need |
02:44
🔗
|
yipdw |
well |
02:44
🔗
|
yipdw |
and a 64-bit system |
02:44
🔗
|
FalconK |
ha |
02:45
🔗
|
xmc |
all that and a bag of chips |
02:45
🔗
|
yipdw |
I mean curl or whatever, it doesn't matter really |
02:52
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
02:53
🔗
|
FalconK |
I' |
02:54
🔗
|
|
mismatch_ has quit IRC (Remote host closed the connection) |
02:54
🔗
|
FalconK |
once we get permissions set up I can test my thing |
02:54
🔗
|
|
mismatch_ has joined #archiveteam |
02:56
🔗
|
|
dashcloud has joined #archiveteam |
03:09
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
03:10
🔗
|
|
MMovie has joined #archiveteam |
03:11
🔗
|
|
n00b599 has quit IRC (Quit: Page closed) |
03:22
🔗
|
|
superkuh has quit IRC (Quit: the neuronal action potential is an electrical manipulation of reversible abrupt phase changes in the lipid bilaye) |
03:23
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
03:24
🔗
|
|
MMovie has joined #archiveteam |
03:37
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
03:37
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
03:38
🔗
|
|
MMovie has joined #archiveteam |
03:40
🔗
|
|
Stiletto has quit IRC (Read error: Connection reset by peer) |
03:41
🔗
|
|
Stolett0 has joined #archiveteam |
03:44
🔗
|
JesseW |
SketchCow: BTW, when I try to turn on sound on https://archive.org/details/msdos_Alpine_Tram_Ride_1989 it tells me: "This button only works once the emulation is running" even after the emulator appears to be running. IDK if I should email this to you, to info, both or neither. |
03:44
🔗
|
|
Stolett0 is now known as Stiletto |
03:48
🔗
|
|
Stolett0 has joined #archiveteam |
03:58
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
03:59
🔗
|
|
vitzli has joined #archiveteam |
03:59
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
04:00
🔗
|
|
Stolett0 has quit IRC (Read error: Connection reset by peer) |
04:00
🔗
|
|
MMovie has joined #archiveteam |
04:01
🔗
|
|
Stolett0 has joined #archiveteam |
04:17
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
04:19
🔗
|
|
MMovie has joined #archiveteam |
04:25
🔗
|
|
Stolett0 is now known as Stiletto |
04:26
🔗
|
|
Stiletto is now known as Stilett0 |
04:26
🔗
|
|
Stilett0 is now known as Stiletto |
04:29
🔗
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
04:44
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
04:45
🔗
|
|
MMovie has joined #archiveteam |
05:02
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
05:04
🔗
|
|
MMovie has joined #archiveteam |
05:07
🔗
|
|
RedType has joined #archiveteam |
05:07
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
05:14
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
05:14
🔗
|
|
bwn has joined #archiveteam |
05:14
🔗
|
|
metalcamp has joined #archiveteam |
05:21
🔗
|
|
Sk1d has joined #archiveteam |
05:22
🔗
|
|
metalcamp has quit IRC (Ping timeout: 258 seconds) |
05:26
🔗
|
|
vitzli has quit IRC (Leaving) |
05:26
🔗
|
|
vitzli has joined #archiveteam |
05:29
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
05:29
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
05:31
🔗
|
|
MMovie has joined #archiveteam |
05:37
🔗
|
Fletcher |
re: pipeline direct to IA, could we just replace uploader.py with the script FOS uses to process archivebot warcs and replace the collection name? |
05:38
🔗
|
xmc |
i think that's what FalconK is working on |
05:58
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
05:59
🔗
|
|
MMovie has joined #archiveteam |
06:17
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
06:18
🔗
|
|
MMovie has joined #archiveteam |
06:19
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
06:35
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
06:35
🔗
|
|
MMovie has joined #archiveteam |
06:53
🔗
|
|
Ungstein has joined #archiveteam |
06:53
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
06:54
🔗
|
|
MMovie has joined #archiveteam |
06:55
🔗
|
|
Ungstein1 has quit IRC (Ping timeout: 260 seconds) |
06:56
🔗
|
|
fie has quit IRC (Read error: Connection reset by peer) |
07:11
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
07:12
🔗
|
|
MMovie has joined #archiveteam |
07:14
🔗
|
|
roninski has joined #archiveteam |
07:14
🔗
|
|
ndizzle has joined #archiveteam |
07:17
🔗
|
SketchCow |
DOSBOX always works with sound on. It never is silent. |
07:21
🔗
|
|
metalcamp has joined #archiveteam |
07:25
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
07:25
🔗
|
roninski |
is there any chance someone's got a local copy of the fanfiction.net archive and can help me grab a particular file from it so i don't have to download the whole part? |
07:25
🔗
|
MrRadar |
Sure, which file are you looking for? |
07:26
🔗
|
roninski |
it's in part 9, lemme find the exact directory |
07:27
🔗
|
|
xXx_ndidd has quit IRC (Read error: Operation timed out) |
07:27
🔗
|
|
MMovie has joined #archiveteam |
07:28
🔗
|
roninski |
this folder: 1/18/183/u/1835454 |
07:29
🔗
|
MrRadar |
Hmm, you must be referring to a different fanfiction.net archive than the one I have |
07:29
🔗
|
roninski |
do you have the story archive one? |
07:29
🔗
|
MrRadar |
This is the one I have https://archive.org/details/FanfictionNearlyCompleteArchive |
07:30
🔗
|
roninski |
ahh yeah different one |
07:31
🔗
|
roninski |
i'm looking for this one: https://archive.org/details/archiveteam-fanfiction-warc-09 |
07:31
🔗
|
roninski |
thanks anyway :) |
07:32
🔗
|
roninski |
(unfortunately it's not a story i'm looking for it's a user profile) |
07:36
🔗
|
|
RichardG has joined #archiveteam |
07:41
🔗
|
ErkDog |
roninski which file was it you were wanting to download |
07:41
🔗
|
ErkDog |
I could torrent it for you |
07:44
🔗
|
roninski |
00000009.tar.megawarc.warc.gz from here - https://archive.org/download/archiveteam-fanfiction-warc-09, there's just a particular part of it i want (specifically the folder 1/18/183/u/1835454 within the archive) but i'm worried i'll kill my quota if i download the full file |
07:46
🔗
|
roninski |
unfortunately i'm not on unlimited but even though my quota is pretty big, considering how many people are in my household and how early in the month it is i'm not willing to risk it |
07:46
🔗
|
roninski |
considering the actual folder in the archive is probably tiny XD |
07:50
🔗
|
ErkDog |
quote on your interwebs? |
07:50
🔗
|
Fletcher |
roninski if you don't have it in ~5 hours highlight me and I'll grab it |
07:51
🔗
|
roninski |
250gb on 250gb off 5 person household\ |
07:51
🔗
|
roninski |
thanks Fletcher :) |
07:51
🔗
|
ErkDog |
ahhhh sux bro, where do you live? I'm downloading the main file now, but will take a few hours :( |
07:51
🔗
|
roninski |
Australia |
07:52
🔗
|
roninski |
and thanks man, really appreciate it :) |
07:53
🔗
|
|
WinterFox has joined #archiveteam |
07:53
🔗
|
roninski |
i'm moving to the US next month and should be able to get something better for my actual usage needs but until then I'm kinda stuck with terrible Aussie internet XD |
07:53
🔗
|
roninski |
where're you ErkDog? |
07:55
🔗
|
ErkDog |
US |
07:56
🔗
|
roninski |
where abouts? |
07:57
🔗
|
ErkDog |
Virginia |
07:58
🔗
|
roninski |
ah nice, i'm moving to Seattle |
08:00
🔗
|
* |
xmc waves from seattle |
08:01
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
08:01
🔗
|
|
MMovie has joined #archiveteam |
08:05
🔗
|
ErkDog |
don't forget your umbrella!@!! |
08:18
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
08:20
🔗
|
|
MMovie has joined #archiveteam |
08:22
🔗
|
|
atomotic has joined #archiveteam |
08:24
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
08:28
🔗
|
|
dashcloud has joined #archiveteam |
08:32
🔗
|
|
schbirid has joined #archiveteam |
08:39
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
08:52
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
08:53
🔗
|
|
MMovie has joined #archiveteam |
08:57
🔗
|
|
metalcamp has quit IRC (Ping timeout: 258 seconds) |
09:02
🔗
|
|
redlob has quit IRC (Quit: ZNC - http://znc.in) |
09:03
🔗
|
|
redlob has joined #archiveteam |
09:34
🔗
|
|
bwn has joined #archiveteam |
09:37
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
09:38
🔗
|
|
MMovie has joined #archiveteam |
09:38
🔗
|
|
vtyl has quit IRC (Ping timeout: 250 seconds) |
09:42
🔗
|
|
lytv has joined #archiveteam |
10:09
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
10:10
🔗
|
|
MMovie has joined #archiveteam |
10:22
🔗
|
|
jut has joined #archiveteam |
10:27
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
10:29
🔗
|
|
MMovie has joined #archiveteam |
10:47
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
10:49
🔗
|
|
MMovie has joined #archiveteam |
11:07
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
11:07
🔗
|
|
MMovie has joined #archiveteam |
11:09
🔗
|
|
metalcamp has joined #archiveteam |
11:24
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
11:26
🔗
|
|
MMovie has joined #archiveteam |
11:43
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
11:44
🔗
|
|
MMovie has joined #archiveteam |
11:45
🔗
|
|
signius has quit IRC (Read error: Operation timed out) |
11:49
🔗
|
|
signius has joined #archiveteam |
11:50
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
11:54
🔗
|
|
dashcloud has joined #archiveteam |
12:01
🔗
|
|
vOYtEC has quit IRC (Quit: rm -r *) |
12:08
🔗
|
|
[phire] has quit IRC (Quit: ZNC - http://znc.in) |
12:20
🔗
|
|
[phire] has joined #archiveteam |
12:20
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
12:22
🔗
|
|
MMovie has joined #archiveteam |
12:27
🔗
|
|
Sk2d has joined #archiveteam |
12:27
🔗
|
|
PurpleSym has quit IRC (*) |
12:27
🔗
|
|
PurpleSym has joined #archiveteam |
12:27
🔗
|
|
Sk1d has quit IRC (hub.se irc.du.se) |
12:35
🔗
|
|
atomotic has quit IRC (Quit: My Mac has gone to sleep. ZZZzzz…) |
12:42
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
12:43
🔗
|
|
Sk2d is now known as Sk1d |
12:43
🔗
|
|
metalcamp has quit IRC (Read error: Connection reset by peer) |
12:44
🔗
|
|
MMovie has joined #archiveteam |
12:45
🔗
|
|
metalcamp has joined #archiveteam |
12:49
🔗
|
|
VADemon has joined #archiveteam |
12:59
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
12:59
🔗
|
|
WinterFox has quit IRC (Remote host closed the connection) |
13:00
🔗
|
|
MMovie has joined #archiveteam |
13:15
🔗
|
|
atomotic has joined #archiveteam |
13:35
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
13:35
🔗
|
|
MMovie has joined #archiveteam |
13:38
🔗
|
|
dserodio has quit IRC (Quit: ZNC - http://znc.in) |
13:52
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
13:54
🔗
|
|
MMovie has joined #archiveteam |
13:55
🔗
|
|
dserodio has joined #archiveteam |
13:58
🔗
|
|
brayden_ has joined #archiveteam |
13:58
🔗
|
|
swebb sets mode: +o brayden_ |
14:02
🔗
|
|
brayden has quit IRC (Read error: Operation timed out) |
14:12
🔗
|
|
pgoetz has quit IRC (Remote host closed the connection) |
14:14
🔗
|
|
pgoetz has joined #archiveteam |
14:14
🔗
|
|
pgoetz has quit IRC (Remote host closed the connection) |
14:14
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
14:15
🔗
|
|
MMovie has joined #archiveteam |
14:32
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
14:34
🔗
|
|
MMovie has joined #archiveteam |
14:43
🔗
|
|
metalcamp has quit IRC (Ping timeout: 258 seconds) |
14:44
🔗
|
HCross |
The BBCshop.com is closing |
14:44
🔗
|
HCross |
http://www.bbcshop.com/page/helpfaq |
15:00
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
15:01
🔗
|
|
MMovie has joined #archiveteam |
15:02
🔗
|
|
pgoetz has joined #archiveteam |
15:08
🔗
|
|
metalcamp has joined #archiveteam |
15:12
🔗
|
|
scyther has joined #archiveteam |
15:35
🔗
|
|
dzman has joined #archiveteam |
15:37
🔗
|
|
brayden_ is now known as brayden |
15:37
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
15:39
🔗
|
|
MMovie has joined #archiveteam |
15:42
🔗
|
dzman |
Guys, is there a(n easy) way finding a user in the hyves archive? :-) |
15:44
🔗
|
midas |
not really |
15:44
🔗
|
midas |
you can however |
15:44
🔗
|
midas |
if you have the username find him or her on the url |
15:45
🔗
|
dzman |
How :)? |
15:45
🔗
|
midas |
but there is not an easy way to search in the entire collection |
15:45
🔗
|
midas |
add the url in the wayback machine |
15:45
🔗
|
midas |
it will find it for you |
15:46
🔗
|
dzman |
What can i use in the url? |
15:47
🔗
|
dzman |
http://hyves.nl/username ? |
15:47
🔗
|
midas |
the username if you know it |
15:47
🔗
|
midas |
i think it was username.hyves.nl |
15:47
🔗
|
dzman |
ah thanks, i will try |
15:48
🔗
|
midas |
good luck :) |
15:49
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
15:51
🔗
|
|
MMovie has joined #archiveteam |
15:51
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
15:52
🔗
|
dzman |
it says Page cannot be crawled or displayed due to robots.txt. every time i use my username :/ |
16:00
🔗
|
midas |
well damn |
16:00
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
16:00
🔗
|
midas |
hyves.nl redirects to hyvesgames.nl which doesnt allow the archive.org crawler |
16:00
🔗
|
midas |
thats shitty. |
16:01
🔗
|
midas |
joepie91: did you notice this yet? |
16:01
🔗
|
|
MMovie has joined #archiveteam |
16:04
🔗
|
|
dzman has quit IRC (Ping timeout: 255 seconds) |
16:18
🔗
|
joepie91 |
I did not |
16:20
🔗
|
PurpleSym |
I downloaded the CDX files for hyves. |
16:20
🔗
|
PurpleSym |
And could grep them. |
16:20
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
16:21
🔗
|
|
MMovie has joined #archiveteam |
16:24
🔗
|
|
metalcamp has quit IRC (Ping timeout: 258 seconds) |
16:38
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
16:40
🔗
|
|
MMovie has joined #archiveteam |
16:51
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
16:54
🔗
|
|
atomotic has joined #archiveteam |
16:56
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
16:57
🔗
|
|
MMovie has joined #archiveteam |
16:59
🔗
|
|
JesseW has joined #archiveteam |
17:13
🔗
|
|
scyther has quit IRC (Quit: Leaving) |
17:13
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
17:14
🔗
|
|
MMovie has joined #archiveteam |
17:22
🔗
|
|
JesseW has quit IRC (Quit: Leaving.) |
17:31
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
17:32
🔗
|
|
MMovie has joined #archiveteam |
17:33
🔗
|
|
xXx_ndidd has joined #archiveteam |
17:38
🔗
|
|
bwn has joined #archiveteam |
17:46
🔗
|
|
ndizzle has quit IRC (Read error: Operation timed out) |
17:47
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
17:48
🔗
|
|
MMovie has joined #archiveteam |
18:06
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
18:07
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
18:08
🔗
|
|
MMovie has joined #archiveteam |
18:13
🔗
|
|
vitzli has quit IRC (Leaving) |
18:16
🔗
|
|
ndizzle has joined #archiveteam |
18:29
🔗
|
|
xXx_ndidd has quit IRC (Read error: Operation timed out) |
18:35
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
18:36
🔗
|
|
MMovie has joined #archiveteam |
18:45
🔗
|
|
HCross has quit IRC (Read error: Connection reset by peer) |
18:49
🔗
|
|
HCross has joined #archiveteam |
19:05
🔗
|
|
yipdw has quit IRC (Ping timeout: 1224 seconds) |
19:05
🔗
|
|
signius has quit IRC (Ping timeout: 345 seconds) |
19:05
🔗
|
|
FalconK has quit IRC (Ping timeout: 345 seconds) |
19:05
🔗
|
|
hive-mind has quit IRC (Ping timeout: 316 seconds) |
19:05
🔗
|
|
SirCmpwn has quit IRC (Ping timeout: 345 seconds) |
19:05
🔗
|
|
bauruine has quit IRC (Ping timeout: 316 seconds) |
19:05
🔗
|
|
ploopkaz- has quit IRC (Ping timeout: 345 seconds) |
19:05
🔗
|
|
Atluxity has quit IRC (Ping timeout: 345 seconds) |
19:05
🔗
|
|
balrog has quit IRC (Ping timeout: 345 seconds) |
19:05
🔗
|
|
dan- has quit IRC (Ping timeout: 345 seconds) |
19:05
🔗
|
|
HCross2 has quit IRC (Read error: Connection reset by peer) |
19:05
🔗
|
|
johtso has quit IRC (Read error: Connection reset by peer) |
19:05
🔗
|
|
zhongfu has quit IRC (Remote host closed the connection) |
19:05
🔗
|
|
wp494_ has joined #archiveteam |
19:05
🔗
|
|
victor has quit IRC (Write error: Broken pipe) |
19:05
🔗
|
|
d_rebel has quit IRC (Write error: Connection reset by peer) |
19:05
🔗
|
|
Vito` has quit IRC (Write error: Connection reset by peer) |
19:05
🔗
|
|
winr4r has quit IRC (Write error: Connection reset by peer) |
19:05
🔗
|
|
victor has joined #archiveteam |
19:05
🔗
|
|
Vito` has joined #archiveteam |
19:05
🔗
|
|
_desu____ has joined #archiveteam |
19:05
🔗
|
|
hive-mind has joined #archiveteam |
19:05
🔗
|
|
bauruine has joined #archiveteam |
19:06
🔗
|
|
d_rebel has joined #archiveteam |
19:06
🔗
|
|
Boltsie_ has joined #archiveteam |
19:06
🔗
|
|
balrog has joined #archiveteam |
19:06
🔗
|
|
swebb sets mode: +o balrog |
19:06
🔗
|
|
ploopkazo has joined #archiveteam |
19:06
🔗
|
|
deathy_ has joined #archiveteam |
19:06
🔗
|
|
Atluxity has joined #archiveteam |
19:06
🔗
|
|
TheKiwi_ has joined #archiveteam |
19:06
🔗
|
|
Ungstein1 has joined #archiveteam |
19:06
🔗
|
|
FalconK has joined #archiveteam |
19:06
🔗
|
|
beeper_ has joined #archiveteam |
19:07
🔗
|
|
casdr_ has joined #archiveteam |
19:07
🔗
|
|
casdr_ has quit IRC (Connection closed) |
19:07
🔗
|
|
beeper_ has quit IRC (Connection closed) |
19:07
🔗
|
|
TheKiwi_ has quit IRC (Connection closed) |
19:07
🔗
|
|
kevin_ has joined #archiveteam |
19:07
🔗
|
|
casdr_ has joined #archiveteam |
19:07
🔗
|
|
TheKiwi_ has joined #archiveteam |
19:07
🔗
|
|
beeper_ has joined #archiveteam |
19:07
🔗
|
|
zhongfu has joined #archiveteam |
19:08
🔗
|
|
Ungstein has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
wp494 has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
_desu___ has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
Boltsie has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
JSharp___ has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
TheKiwi has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
Ctrl-S___ has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
beeper has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
kevin has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
casdr has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
abartov__ has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
karissa__ has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
VonGuard has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
deathy has quit IRC (Ping timeout: 274 seconds) |
19:08
🔗
|
|
_desu____ is now known as _desu___ |
19:08
🔗
|
|
Boltsie_ is now known as Boltsie |
19:08
🔗
|
|
deathy_ is now known as deathy |
19:08
🔗
|
|
SirCmpwn has joined #archiveteam |
19:08
🔗
|
|
casdr_ is now known as casdr |
19:08
🔗
|
|
beeper_ is now known as beeper |
19:08
🔗
|
|
TheKiwi_ is now known as TheKiwi |
19:08
🔗
|
|
kevin_ is now known as kevin |
19:08
🔗
|
|
abartov__ has joined #archiveteam |
19:08
🔗
|
|
dan- has joined #archiveteam |
19:08
🔗
|
|
HCross2 has joined #archiveteam |
19:09
🔗
|
|
beeper has quit IRC (Remote host closed the connection) |
19:09
🔗
|
|
beeper has joined #archiveteam |
19:10
🔗
|
|
johtso has joined #archiveteam |
19:10
🔗
|
|
signius has joined #archiveteam |
19:10
🔗
|
|
TheKiwi has quit IRC (Remote host closed the connection) |
19:11
🔗
|
|
TheKiwi has joined #archiveteam |
19:12
🔗
|
|
winr4r has joined #archiveteam |
19:23
🔗
|
|
jut has quit IRC (jut) |
19:30
🔗
|
|
Tomcat_ has joined #archiveteam |
19:33
🔗
|
|
Tomcat__ has joined #archiveteam |
19:35
🔗
|
|
Tomcat_ has quit IRC (Read error: Operation timed out) |
19:56
🔗
|
FalconK |
Fletcher: code should drop this week |
20:05
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
20:05
🔗
|
|
Tomcat__ has quit IRC (Remote host closed the connection) |
20:06
🔗
|
|
MMovie has joined #archiveteam |
20:22
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
20:24
🔗
|
|
MMovie has joined #archiveteam |
20:29
🔗
|
|
ndiddy has joined #archiveteam |
20:30
🔗
|
|
ndizzle has quit IRC (Read error: Operation timed out) |
20:39
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
20:42
🔗
|
|
dashcloud has joined #archiveteam |
20:49
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
20:51
🔗
|
|
MMovie has joined #archiveteam |
21:01
🔗
|
|
metalcamp has joined #archiveteam |
21:06
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
21:08
🔗
|
|
MMovie has joined #archiveteam |
21:20
🔗
|
|
jake1 has joined #archiveteam |
21:21
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
21:23
🔗
|
|
MMovie has joined #archiveteam |
21:24
🔗
|
jake1 |
MrRadar: SketchCow passed along the OverflowError bug you found in the ia CLI when uploading files larger than ~2GB. This should fix it here: https://github.com/jjjake/internetarchive/commit/6c9f77cb9b57296bc88278b5716c7a3bc32c3b43 |
21:25
🔗
|
jake1 |
That fix will be in v1.0.2, which I hope to release later today. |
21:33
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
21:34
🔗
|
* |
JW_work waves to jake1 :-) |
21:46
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
21:48
🔗
|
|
MMovie has joined #archiveteam |
21:56
🔗
|
|
fie has joined #archiveteam |
22:01
🔗
|
dxrt |
Thanks jake1! |
22:07
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
22:08
🔗
|
MrRadar |
jake1: Thanks for the fix. |
22:09
🔗
|
|
metalcamp has quit IRC (Ping timeout: 258 seconds) |
22:11
🔗
|
|
dashcloud has joined #archiveteam |
22:24
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
22:25
🔗
|
|
MMovie has joined #archiveteam |
22:42
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
22:43
🔗
|
|
MMovie has joined #archiveteam |
22:47
🔗
|
ErkDog |
OK so I told roninski I would help him get a folder out of a web archive, so I downloaded it and extracted it, so now I have this 138 Gig warc file that I don't know what to do with :( |
22:49
🔗
|
MrRadar |
You can use warcat to extract it |
22:49
🔗
|
MrRadar |
https://pypi.python.org/pypi/Warcat/ |
22:49
🔗
|
MrRadar |
There are other tools as well |
22:49
🔗
|
ErkDog |
that's pythony, I'm in windows :( |
22:51
🔗
|
|
yipdw has joined #archiveteam |
22:52
🔗
|
ErkDog |
this looks promising: https://github.com/ikreymer/webarchiveplayer |
22:52
🔗
|
MrRadar |
Well, that will let you browse the web archive (like the IA's Wayback Machine) |
22:52
🔗
|
MrRadar |
You can install Python on Windows |
22:54
🔗
|
|
wp494_ is now known as wp494 |
22:55
🔗
|
ErkDog |
ohhhh yeah I guess I can |
22:56
🔗
|
ErkDog |
lol it would be cool if I could run the scripts from windows command line instead of loosing all these resources to the virtual box hypervisor, lol |
22:57
🔗
|
JW_work |
ErkDog: and actually, there's an instance of (a variant of) webarchiveplayer running at http://archivelab.org:3579/item/{IA identifier} so if you put the archive identifier you downloaded in at the end, then /*/ then the website, you should be able to get it without even downloading it. |
22:59
🔗
|
|
JW_work has left |
22:59
🔗
|
|
JW_work has joined #archiveteam |
22:59
🔗
|
ErkDog |
OOOO'rly thanks JW_work |
22:59
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
22:59
🔗
|
yipdw |
so, webarchiveplayer is also bundled as a Windows application |
23:01
🔗
|
|
MMovie has joined #archiveteam |
23:01
🔗
|
ErkDog |
what constitutes the "IA Identifier" of : https://archive.org/download/archiveteam-fanfiction-warc-09 |
23:01
🔗
|
|
RedType has left |
23:02
🔗
|
MrRadar |
archiveteam-fanfiction-warc-09 |
23:17
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
23:19
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
23:21
🔗
|
|
MMovie has joined #archiveteam |
23:38
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
23:40
🔗
|
|
MMovie has joined #archiveteam |
23:56
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
23:57
🔗
|
|
MMovie has joined #archiveteam |