Time |
Nickname |
Message |
00:05
🔗
|
|
dtm has quit IRC (Read error: Operation timed out) |
00:11
🔗
|
|
ete_ has quit IRC (Read error: Connection reset by peer) |
00:11
🔗
|
|
cechk01 has quit IRC (Read error: Connection reset by peer) |
00:12
🔗
|
|
dtm has joined #archiveteam |
00:59
🔗
|
|
sep332 has quit IRC (Read error: Operation timed out) |
01:00
🔗
|
|
sep332 has joined #archiveteam |
01:02
🔗
|
|
bwn has quit IRC (Read error: Connection reset by peer) |
01:03
🔗
|
|
bwn has joined #archiveteam |
01:05
🔗
|
phuzion |
tree3: Any luck with downloading the videos by chance? |
01:10
🔗
|
kyan |
phuzion: For what it's worth, I've gotten most of them |
01:10
🔗
|
phuzion |
kyan: How many did you grab? |
01:11
🔗
|
kyan |
not sure yet |
01:11
🔗
|
kyan |
Thing is, some of them did'nt download fully |
01:11
🔗
|
kyan |
(left .part files in the directory) |
01:11
🔗
|
phuzion |
What are you using to download? youtube-dl? |
01:11
🔗
|
kyan |
yup |
01:11
🔗
|
phuzion |
Getting around the throttle somehow? I'm getting 500KB/s max |
01:11
🔗
|
kyan |
and some of them weren't available due to takedown requests and stuff |
01:12
🔗
|
kyan |
Yeah, youtube-dl does a rate-bypass thng |
01:12
🔗
|
kyan |
I got between 5 and 35 mbps |
01:12
🔗
|
kyan |
You can see what I got at https://archive.org/search.php?query=subject%3A%22WARCdealer%20pack%22%20AND%20subject%3A%22rochu%22 |
01:13
🔗
|
kyan |
not all of tem are uploaded yet though |
01:13
🔗
|
phuzion |
gotcha |
01:27
🔗
|
|
ParkerR has quit IRC (Remote host closed the connection) |
01:42
🔗
|
|
philpem has quit IRC (Ping timeout: 252 seconds) |
01:43
🔗
|
|
bwn_ has joined #archiveteam |
01:46
🔗
|
|
xXx_ndidd has joined #archiveteam |
01:47
🔗
|
|
K4k_ has quit IRC (Read error: Operation timed out) |
01:47
🔗
|
|
Ghost_of_ has quit IRC (Remote host closed the connection) |
01:47
🔗
|
|
K4k_ has joined #archiveteam |
01:49
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
01:53
🔗
|
|
ndiddy has quit IRC (Read error: Operation timed out) |
01:58
🔗
|
kyan |
phuzion: You can see the ones that I haven't gotten yet at http://paste.ubuntu.com/13913492/ |
01:58
🔗
|
kyan |
search for ".part" |
01:59
🔗
|
|
JesseW has joined #archiveteam |
02:26
🔗
|
|
Start has joined #archiveteam |
02:42
🔗
|
|
ParkerR has joined #archiveteam |
03:10
🔗
|
|
cechk01 has joined #archiveteam |
03:17
🔗
|
|
RichardG has quit IRC (Ping timeout: 252 seconds) |
03:25
🔗
|
|
Ungstein1 has quit IRC (Quit: Leaving.) |
03:38
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
03:40
🔗
|
|
bwn_ has quit IRC (Read error: Operation timed out) |
03:40
🔗
|
|
Start has joined #archiveteam |
03:48
🔗
|
|
WinterFox has joined #archiveteam |
03:52
🔗
|
|
balrog has quit IRC (Bye) |
03:54
🔗
|
|
balrog has joined #archiveteam |
03:54
🔗
|
|
swebb sets mode: +o balrog |
03:54
🔗
|
|
JetBalsa has joined #archiveteam |
03:55
🔗
|
JetBalsa |
I have a interesting question, I want to run a warrior but not in a VM but on shell on a existing system |
03:55
🔗
|
JetBalsa |
whats the current codebase at, I found seesaw kit and warrior and I'm confused on a current warrior in use. |
03:59
🔗
|
dashcloud |
running the warrior outside of a VM is sort of discouraged, because the consistency provided by the image is no longer there- usually you would just run the script for whatever project you are interested in |
04:03
🔗
|
aaaaaaaaa |
each project has general instructions for running without a warrior, as well as distribution specific instructions |
04:04
🔗
|
phuzion |
Yeah, it's totally doable. Lots of us do it. It's just something that requires a little bit more knowledge. |
04:05
🔗
|
phuzion |
Getting people to run the warrior is easy because it's "install virtualbox, download this file, and do file > import > click every next button you see, right click the VM and start it" |
04:08
🔗
|
JetBalsa |
I kinda like the set and forget aspect of the auto side of things :3 |
04:08
🔗
|
phuzion |
Which is what the warrior vm is great for. |
04:09
🔗
|
JetBalsa |
Ya, |
04:09
🔗
|
JetBalsa |
I wonder if I can cross load into qemu, trying now :3 |
04:09
🔗
|
phuzion |
We should either continue this conversation in #archiveteam-bs or #warrior |
04:10
🔗
|
JetBalsa |
ill move warrior |
04:37
🔗
|
SketchCow |
https://archive.org/details/macaddict&tab=collection |
05:04
🔗
|
|
JetBalsa has quit IRC (Quit: Page closed) |
05:05
🔗
|
godane |
SketchCow: cool |
05:06
🔗
|
godane |
i'm grabbing issue one of macaddict right now |
05:06
🔗
|
godane |
is wired magazine going to be put up too? |
05:07
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
05:11
🔗
|
|
indrora has joined #archiveteam |
05:12
🔗
|
SketchCow |
Maybe. |
05:13
🔗
|
|
indrora has left |
05:14
🔗
|
|
indrora has joined #archiveteam |
05:15
🔗
|
indrora |
So, a site that I vaguely believe should be archived is ska-dead due to problems. Was wondering if there's anything I can do to make archive team and FurNation (think pre-geocities for furries, started ~1996) get together? |
05:31
🔗
|
JesseW |
indrora: What's the URL? About how many pages is it? |
05:33
🔗
|
indrora |
JesseW, http://furnation.com/ and probably somewhere in the order of 10ish TB -- 20 years of furries. |
05:33
🔗
|
indrora |
What I know from the twitter is that their primary servers had 128GB of RAM and several 2TB disks |
05:36
🔗
|
|
voltagex has quit IRC (Quit: WeeChat 1.3) |
05:37
🔗
|
|
JetBalsa has joined #archiveteam |
05:40
🔗
|
indrora |
I have no idea the actual /scale/. I know that if the maintainer can be contacted, it'd be relatively simple to do what was done with pomf.se |
05:40
🔗
|
JesseW |
What did you mean by "ska-dead"? (not a term I've heard) |
05:43
🔗
|
|
nertzy has joined #archiveteam |
05:45
🔗
|
indrora |
It's dead. So dead if it were any deadder it'd be pushing up daisies. |
05:45
🔗
|
indrora |
There's no read-only version. There's no access other than broken archive.org content. |
05:48
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
05:49
🔗
|
JesseW |
ah. |
05:51
🔗
|
|
xXx_ndidd has quit IRC (Read error: Connection reset by peer) |
05:54
🔗
|
indrora |
Much of the site was powered by a lot of custom PHP. |
05:56
🔗
|
JesseW |
Do you have any means of contacting the admin? |
05:58
🔗
|
|
Sk1d has joined #archiveteam |
05:58
🔗
|
indrora |
The most I know is via Twitter ( @furnation ) -- I've idly mentioned textfiles and Archive Team, but I'm personally not aware of any direct way to contact them. |
06:06
🔗
|
indrora |
I'll see what I can do to get in contact with them. |
06:17
🔗
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
06:19
🔗
|
JesseW |
Yes, point them at textfiles/SketchCow/Jason Scott. |
06:21
🔗
|
indrora |
Will do. |
06:27
🔗
|
|
VonGuard has quit IRC (Read error: Connection reset by peer) |
06:27
🔗
|
|
VonGuard has joined #archiveteam |
07:13
🔗
|
JetBalsa |
Are there any plans to archive old reddit posts? |
07:14
🔗
|
ivan` |
JetBalsa: https://archive.org/details/2015_reddit_comments_corpus |
07:15
🔗
|
JetBalsa |
Approximately 350,000 comments out of ~1.65 billion were unavailable |
07:16
🔗
|
JetBalsa |
Also, I was thinking of entire threads in context |
07:18
🔗
|
|
remsen2 has joined #archiveteam |
07:18
🔗
|
ivan` |
JetBalsa: the dataset there doesn't have a 'parent' for the comments? |
07:18
🔗
|
JetBalsa |
looks like thats a smaller dataset |
07:18
🔗
|
JetBalsa |
others exists: https://www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/ |
07:18
🔗
|
|
R5M has joined #archiveteam |
07:19
🔗
|
* |
ivan` sees a 'parent' on "Example JSON Block" |
07:19
🔗
|
JetBalsa |
its there, I see that now |
07:19
🔗
|
ivan` |
the page I linked links to that :/ |
07:19
🔗
|
JetBalsa |
GG |
07:19
🔗
|
|
remsen has quit IRC (Read error: Operation timed out) |
07:20
🔗
|
JetBalsa |
I read the 300k out of sentence backwards, sorrya about that |
07:20
🔗
|
|
MMovie1 has quit IRC (Read error: Connection reset by peer) |
07:22
🔗
|
|
MMovie has joined #archiveteam |
07:22
🔗
|
indrora |
you could theoretically bruteforce the entire reddit object ID space |
07:23
🔗
|
indrora |
Given that basically everything in reddit is a K:V |
07:23
🔗
|
|
remsen2 has quit IRC (Read error: Operation timed out) |
07:24
🔗
|
|
R5M has quit IRC (Leaving) |
07:28
🔗
|
godane |
SketchCow: i'm looking at the mac addict magazines |
07:28
🔗
|
godane |
and i think most of covers have to be rescan |
07:28
🔗
|
godane |
other then that there very good |
07:29
🔗
|
godane |
i only say that cause they looked a bit cut off on the left with alot of the covers |
07:51
🔗
|
|
bwn_ has joined #archiveteam |
07:57
🔗
|
|
tree3 has quit IRC (Read error: Operation timed out) |
07:57
🔗
|
|
JesseW has quit IRC (Leaving.) |
08:07
🔗
|
|
remsen has joined #archiveteam |
08:13
🔗
|
|
WinterFox has quit IRC (Read error: Operation timed out) |
08:16
🔗
|
|
WinterFox has joined #archiveteam |
08:45
🔗
|
|
midas1 is now known as midas |
08:45
🔗
|
midas |
yes! |
08:59
🔗
|
|
JetBalsa has quit IRC (Read error: Connection reset by peer) |
09:29
🔗
|
|
cadbury has quit IRC (Read error: Operation timed out) |
09:36
🔗
|
|
schbirid has joined #archiveteam |
09:46
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
10:22
🔗
|
|
wutno has joined #archiveteam |
10:24
🔗
|
|
WapCapLet has quit IRC (Read error: Operation timed out) |
10:36
🔗
|
|
blergh- has quit IRC (Remote host closed the connection) |
11:16
🔗
|
|
vitzli has joined #archiveteam |
12:07
🔗
|
PurpleSym |
xmc: Did the regex for gitorious work? |
12:58
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
13:01
🔗
|
|
dashcloud has joined #archiveteam |
13:14
🔗
|
|
Billy_ has joined #archiveteam |
13:14
🔗
|
|
Billy__ has joined #archiveteam |
13:15
🔗
|
* |
Billy__ slaps dashcloud around a bit with a large fishbot |
13:18
🔗
|
|
Billy_ has quit IRC (Ping timeout: 240 seconds) |
13:19
🔗
|
|
Billy__ has quit IRC (Ping timeout: 240 seconds) |
13:26
🔗
|
|
RichardG has joined #archiveteam |
13:28
🔗
|
|
REiN^ has joined #archiveteam |
13:36
🔗
|
|
melody has joined #archiveteam |
13:45
🔗
|
|
philpem has joined #archiveteam |
14:18
🔗
|
|
WinterFox has quit IRC (Read error: Operation timed out) |
14:20
🔗
|
|
WinterFox has joined #archiveteam |
14:36
🔗
|
|
Rickster has joined #archiveteam |
14:40
🔗
|
|
Stiletto has quit IRC () |
14:42
🔗
|
|
nertzy has joined #archiveteam |
14:59
🔗
|
|
Stiletto has joined #archiveteam |
15:02
🔗
|
K4k_ |
Anyone know if the BYTE magazine archive on archive.org is available in a zip or tar format somewhere? I'd like the whole collection but I don't want to have to get every issue seperately. |
15:05
🔗
|
Vito`__ |
K4k_: I don't know if there's a separate item for the whole thing, but you can use their API to get a whole collection: https://emerging.commons.gc.cuny.edu/2014/03/downloading-items-internet-archive-collection-using-python/ |
15:05
🔗
|
|
Vito`__ is now known as Vito` |
15:26
🔗
|
|
bauruine has quit IRC (Ping timeout: 252 seconds) |
15:27
🔗
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
15:56
🔗
|
|
andrewf has joined #archiveteam |
15:56
🔗
|
|
tree3 has joined #archiveteam |
16:03
🔗
|
|
andrewf has quit IRC (Quit: Page closed) |
16:33
🔗
|
|
bauruine has joined #archiveteam |
16:44
🔗
|
|
remsen2 has joined #archiveteam |
16:44
🔗
|
|
remsen2 has quit IRC (Remote host closed the connection) |
16:49
🔗
|
|
remsen has quit IRC (Read error: Operation timed out) |
16:52
🔗
|
|
Woflie has joined #archiveteam |
16:53
🔗
|
Woflie |
o7 http://furnation.com/ o7 *Bugles out some taps, disappears* |
16:53
🔗
|
|
Woflie has quit IRC (Client Quit) |
16:54
🔗
|
|
remsen has joined #archiveteam |
17:17
🔗
|
Atluxity |
would anyone care to help me find (if possible) the files for https://web.archive.org/web/20110302231052/http://wano.blip.tv/posts?view=archive&nsfw=dc |
17:18
🔗
|
|
VADemon has joined #archiveteam |
17:18
🔗
|
Atluxity |
I read somewhere that all of blip.tv was archived somehow |
17:18
🔗
|
Atluxity |
but I can't seem to find them |
17:18
🔗
|
arkiver |
I'll have a look at it |
17:19
🔗
|
Atluxity |
thanks |
17:19
🔗
|
Atluxity |
as far as I remember blip.tv also has an option for "upload this file to archive.org as well?" and I think that option was used |
17:22
🔗
|
arkiver |
so it looks like the IDs of the videos of https://web.archive.org/web/20110302231052/http://wano.blip.tv/posts?view=archive&nsfw=dc are not the same as on the normal blip.tv site |
17:22
🔗
|
arkiver |
We did not archive wano.blip.tv, we only archived blip.tv |
17:22
🔗
|
arkiver |
But if the videos from wano are also on blip.tv, they should be saved |
17:23
🔗
|
Atluxity |
I think they used to be blip.tv/wano |
17:23
🔗
|
Atluxity |
or simular |
17:26
🔗
|
arkiver |
I can't find the videos. The IDs are not the same as on blip.tv. |
17:26
🔗
|
arkiver |
It's from 2011 though |
17:26
🔗
|
arkiver |
It's very possible that these were already gone when we started archiving |
17:29
🔗
|
Atluxity |
right |
17:29
🔗
|
Atluxity |
ok, thanks for trying |
17:42
🔗
|
|
vitzli has quit IRC (Leaving) |
17:51
🔗
|
|
remsen2 has joined #archiveteam |
17:52
🔗
|
|
remsen2 has quit IRC (Remote host closed the connection) |
17:57
🔗
|
|
remsen has quit IRC (Read error: Operation timed out) |
18:07
🔗
|
|
remsen has joined #archiveteam |
18:09
🔗
|
|
remsen has quit IRC (Client Quit) |
18:10
🔗
|
|
remsen has joined #archiveteam |
18:12
🔗
|
K4k_ |
Vito`: Thanks, I didn't know they had an API for that kind of operation. I will look in to it! |
18:14
🔗
|
xmc |
PurpleSym: gitorious is ready but i have to do some sysadmining on the backend still |
18:19
🔗
|
|
nertzy has joined #archiveteam |
18:19
🔗
|
PurpleSym |
Alright. |
18:47
🔗
|
arkiver |
SketchCow: I'm not sure how much free space FOS has right now, but a lot of new FTP data is currently coming in |
18:54
🔗
|
SketchCow |
I am aware. We're holding out (under 50% use) but that 50% use is one day's downloads, so that's pretty involved. |
19:03
🔗
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
19:11
🔗
|
|
bwn_ has quit IRC (Read error: Operation timed out) |
19:25
🔗
|
|
arkiver2 has joined #archiveteam |
19:25
🔗
|
|
aaaaaaaaa has joined #archiveteam |
19:25
🔗
|
|
swebb sets mode: +o aaaaaaaaa |
19:28
🔗
|
|
xmc has quit IRC (Quit: brb rebooting) |
19:29
🔗
|
|
cadbury has joined #archiveteam |
19:32
🔗
|
SketchCow |
*** WHO HERE IS DKL3 ON ARCHVE |
19:35
🔗
|
|
arkiver2 has quit IRC (Ping timeout: 252 seconds) |
19:36
🔗
|
arkiver |
I see DKL3 uploaded a lot of WARCs |
19:36
🔗
|
arkiver |
what's the problem with them? |
19:38
🔗
|
arkiver |
https://trello.com/dkl31 from Bibliotheca Anonoma |
19:40
🔗
|
|
bwn has joined #archiveteam |
19:42
🔗
|
|
xmc has joined #archiveteam |
19:42
🔗
|
|
swebb sets mode: +o xmc |
19:44
🔗
|
arkiver |
antonizoo might know more about who dkl3 is |
19:49
🔗
|
SketchCow |
No, no. |
19:49
🔗
|
SketchCow |
The upshot is they can upload WARCs but they're not going into Wayback. |
19:53
🔗
|
|
arkiver2 has joined #archiveteam |
20:00
🔗
|
|
BlueMaxim has joined #archiveteam |
20:00
🔗
|
|
arkiver2 has quit IRC (Ping timeout: 252 seconds) |
20:04
🔗
|
|
arkiver2 has joined #archiveteam |
20:24
🔗
|
|
ndiddy has joined #archiveteam |
20:25
🔗
|
|
K4k_ has quit IRC (Quit: WeeChat 1.3) |
20:33
🔗
|
|
arkiver2 has quit IRC (Ping timeout: 252 seconds) |
21:01
🔗
|
|
JetBalsa has joined #archiveteam |
21:01
🔗
|
JetBalsa |
Whats the current status of Yuko project, I have not gotten any new items in 24hr |
21:07
🔗
|
JetBalsa |
Yuku* |
21:08
🔗
|
phuzion |
JetBalsa: Doesn't appear to be giving out new items at this time for one reason or another. |
21:24
🔗
|
JetBalsa |
Term1T3rm1n@l1! |
21:24
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
21:27
🔗
|
|
RichardG has joined #archiveteam |
21:30
🔗
|
|
arkiver2 has joined #archiveteam |
21:57
🔗
|
|
WinterFox has quit IRC (Read error: Operation timed out) |
22:00
🔗
|
|
WinterFox has joined #archiveteam |
22:06
🔗
|
|
Ghost_of_ has joined #archiveteam |
22:19
🔗
|
|
arkiver2 has quit IRC (Quit: Nettalk6 - www.ntalk.de) |
22:26
🔗
|
dashcloud |
JetBalsa: you may want to change that password now that it's out in public |
22:27
🔗
|
JetBalsa |
Its a temp that I give out to nerds, Its ment to be changed, but good thing its not on any public systems |
22:27
🔗
|
JetBalsa |
GG, Damn you keypass |
22:27
🔗
|
dashcloud |
just wanted to make sure you knew that it was pasted here |
22:27
🔗
|
JetBalsa |
Ya |
22:28
🔗
|
|
melody has quit IRC (Read error: Operation timed out) |
22:28
🔗
|
antonizoo |
Hello. I noticed that I was mentioned here. |
22:29
🔗
|
antonizoo |
Regarding, specifically, DKL3's WARCs. |
22:30
🔗
|
|
melody has joined #archiveteam |
22:31
🔗
|
antonizoo |
I can answer any questions about it, because we have wanted to contact the Internet Archive directly regarding the upload of these WARCs for a while |
22:33
🔗
|
antonizoo |
Wherever you'd like to ask please ping me |
22:33
🔗
|
antonizoo |
Or pm |
22:50
🔗
|
arkiver |
------------------------------------------ |
22:50
🔗
|
arkiver |
The Google Code project has started! |
22:50
🔗
|
arkiver |
Join #googlecodeblue |
22:50
🔗
|
arkiver |
------------------------------------------ |
22:50
🔗
|
Atluxity |
oh yeah |
23:01
🔗
|
|
Ghost_of_ has quit IRC (Quit: Leaving) |
23:01
🔗
|
|
Ghost_of_ has joined #archiveteam |
23:49
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
23:49
🔗
|
|
dashcloud has joined #archiveteam |