#archiveteam-bs 2018-01-12,Fri

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***BlueMaxim has quit IRC (Leaving)
BlueMaxim has joined #archiveteam-bs
[00:07]
...... (idle for 28mn)
atrocityanybody running the seagate shingle drives?
8Tb for $200 isn't bad
and i have a pretty high write-one, read-many scenario
[00:35]
.... (idle for 17mn)
***dashcloud has joined #archiveteam-bs [00:52]
wabu has quit IRC (Read error: Operation timed out)
espes__ has quit IRC (Read error: Operation timed out)
chazchaz has quit IRC (Read error: Operation timed out)
[00:58]
wp494shingle drives - yuck
might as well shuck those wd easystores from bestbuy and save yourself $40 or more
[01:04]
***wabu has joined #archiveteam-bs
Valentine has quit IRC (Ping timeout: 506 seconds)
chazchaz has joined #archiveteam-bs
espes__ has joined #archiveteam-bs
[01:17]
atrocityare they all reds? [01:23]
i mean, knowing my use case, not sure it's bad
a 20Gb cache onboard, which i won't fill up immediately
[01:28]
godanei'm making a audio collection of Charlie Rose episodes i have
its for the myspleen people but could be uploaded as a way for a much smaller data collection of Charlie Rose
[01:31]
atrocityoh, a fellow ms user! [01:32]
godaneSketchCow: is there anyway to make all the podcasts collection files downloadable : https://archive.org/details/The_Jim_Rome_Show_Podcast-2005-01-03
there downloadable for me only
so everyone else just sees broken items with no files
[01:40]
so think a Charlie Rose mp3 collection would be like 250gb vs 2.5tb of video [01:54]
SketchCowhttps://archive.org/details/apkarchive
https://archive.org/details/ipaarchive building
https://archive.org/details/cdromsoftware discovered CD-ROMs in uploads that have graphics attached for seeing what the item is
[01:54]
godanealso metadata is in the mp3 files [01:54]
SketchCowhttps://archive.org/details/cdromimages discovered CD-ROMs in uploads (or in previous cd-rom collections) without graphics attached [01:54]
***Valentine has joined #archiveteam-bs [02:01]
godaneSketchCow: don't upload my video captures yet
i'm having trouble with one file right now
upload speed of FOS is very slow for some reason on my end
[02:03]
M-WillBraSketchCow: so remember like 6mo ago when you wanted an ftp.prserv.net archive? i downloaded most of it but wonder what the preferred format is for splitting things into smaller archive chunks and/or providing a file listing
zipping it all and uploading it all as one chunk is much like trying to fit an entire redwood tree into a woodchipper
[02:12]
***Valentine has quit IRC (Ping timeout: 506 seconds) [02:18]
jdude104 has joined #archiveteam-bs [02:29]
Odd0002 has quit IRC (Ping timeout: 260 seconds) [02:39]
SketchCowI've done that before
But how big is it
[02:42]
***Odd0002 has joined #archiveteam-bs [02:44]
..... (idle for 21mn)
jdude104 has quit IRC (Read error: Operation timed out)
jdude104 has joined #archiveteam-bs
[03:05]
M-WillBraSketchCow: I forget off the top of my head but I think dozens of gigs (and I'm on DSL)
More than 5 and less than 80
I was thinking about zipping each group of folders separately to get a rough split that doesn't require people to download the whole thing if they just want one bit
[03:10]
***godane has quit IRC (Read error: Operation timed out) [03:13]
.... (idle for 17mn)
Somebody280G is fine to upload all as one thing, but dividing it up would be OK too, IMO (but I'm nothing like an authority)
If Hiccup shows up again, point them in the direction of the IA Census.
[03:30]
***godane has joined #archiveteam-bs [03:40]
astridtry to keep items to 5 gigs, but you won't start running into issues until about 100g [03:41]
M-WillBraastrid: thanks, i think it's probably my side having issues more than anything. so i think i need to split it up but i wasn't sure if there was an archive/split/metadata format that was better than, say, a big .txt file with a recursive `ls` dump, etc [03:54]
.... (idle for 18mn)
***dashcloud has quit IRC (Ping timeout: 252 seconds)
dashcloud has joined #archiveteam-bs
[04:12]
dashcloud has quit IRC (Remote host closed the connection) [04:21]
.... (idle for 17mn)
Valentine has joined #archiveteam-bs [04:38]
qw3rty17 has joined #archiveteam-bs
qw3rty16 has quit IRC (Read error: Operation timed out)
[04:47]
Valentine has quit IRC (Ping timeout: 506 seconds) [05:00]
.... (idle for 15mn)
Valentine has joined #archiveteam-bs [05:15]
octothorp has quit IRC (Read error: Operation timed out) [05:27]
octothorp has joined #archiveteam-bs
Valentine has quit IRC (Ping timeout: 506 seconds)
[05:32]
jacketchahey, I have a dumb question
how do you generate the warc record ID?
[05:33]
***Valentine has joined #archiveteam-bs [05:35]
Valentine has quit IRC (Ping timeout: 506 seconds)
jdude104 has quit IRC (Read error: Operation timed out)
[05:43]
Valentine has joined #archiveteam-bs [05:52]
jacketchaI need it for the next update of my chrome extension, because liveweb can't capture past robots.txt [06:00]
Froggingcheck the spec [06:06]
..... (idle for 20mn)
***ReimuHaku has quit IRC (Ping timeout: 250 seconds)
ReimuHaku has joined #archiveteam-bs
[06:26]
.... (idle for 19mn)
jacketchaI did [06:47]
***Valentine has quit IRC (Ping timeout: 506 seconds)
Valentine has joined #archiveteam-bs
[06:49]
Pixi has quit IRC (Quit: Pixi)
Pixi has joined #archiveteam-bs
[07:03]
HCross2SketchCow: did you mean to add random images into https://archive.org/details/archiveteam_newssites items? [07:15]
.................. (idle for 1h27mn)
***tomaspark has quit IRC (Remote host closed the connection) [08:42]
..... (idle for 20mn)
JAAatrocity: For a read-heavy environment, shingled drives seem pretty much perfect. I haven't used them yet, but I plan to do so soon (waiting for the price to come down a bit more on this side of the pond). Regarding WD Red vs. white-labeled: the two are essentially identical except for the PWDIS feature in some of the white drives (which is easily fixable with a bit of tape if your machine doesn't sup
port it).
Heh, I remember now that we discuseed this before.
[09:02]
***tuluu has quit IRC (Ping timeout: 740 seconds)
tuluu has joined #archiveteam-bs
[09:16]
Mateon1 has quit IRC (Read error: Operation timed out)
Mateon1 has joined #archiveteam-bs
[09:21]
..... (idle for 23mn)
ranavalon has joined #archiveteam-bs [09:45]
AeonG__ has quit IRC (Ping timeout: 633 seconds) [09:52]
BlueMaxim has quit IRC (Leaving)
REiN^ has quit IRC (Remote host closed the connection)
[09:58]
.... (idle for 16mn)
icedice has joined #archiveteam-bs [10:16]
..... (idle for 20mn)
altlabel has quit IRC (Read error: Operation timed out) [10:36]
ZexaronS has quit IRC (Quit: Leaving) [10:41]
.................. (idle for 1h27mn)
bwn has quit IRC (Read error: Operation timed out)
odemg has quit IRC (Leaving)
[12:08]
........... (idle for 53mn)
icedice has quit IRC (Ping timeout: 505 seconds) [13:02]
kevinr has quit IRC (Ping timeout: 250 seconds)
kevinr has joined #archiveteam-bs
odemg has joined #archiveteam-bs
[13:09]
............ (idle for 55mn)
Jonafternoon all
Jon pimps SketchCow's podcast on his blog
[14:07]
.... (idle for 18mn)
Sanquiarchiveteam is lacking a third piece of the triforce
irc for short term discussion, wiki for long-term information storage, where's the medium term
we're lacking a forum
channel offshoots like #msgbored with not much activity will never get anywhere without a place to discuss ideas where they won't disappear
[14:25]
***REiN^ has joined #archiveteam-bs [14:30]
SketchCowHCross2: Yes
Oh god not a forum
[14:39]
HCross2Ah thanks. I looked this morning and was confused [14:41]
SketchCowTrying to make the collections prettier and more defined.
Add descriptions to the collections that are clearer. Could use help.
[14:52]
JonSanqui, heh, mailing list?
or a usenet server.
I'd love to have to use usenet again.
[14:56]
Sanquithat might do but I sorta lack the tools, my email handling is horrible
I should get my thunderbird up and running again
[14:56]
Jonmodern versions of MailMan apparently have a web front-end for archives which is nice enough to fool people into thinking they are actually using a forum
although I haven't seen it deployed somewhere for real, which is telling.
[14:57]
Sanquido people still use thunderbird [14:58]
Jonyeah [14:58]
Sanquioki [14:58]
Jonbest of a bad bunch I think [14:58]
Sanquii do like gmail's mail grouping [14:58]
SketchCowOh god not a mailing list
Hey, Archivebot has not shoved a new batch into the archive in what looks like almost a day.
This is actually good news
[15:02]
JonSketchCow, so I take that as tacit approval of a usenet group? :P [15:07]
SketchCowoh god not a usenet group [15:07]
Jon:>
Jon , despite having been a UNIX sysadmin in a prior life, wouldn't know where to begin with that
[15:07]
SketchCowI do
But I will not do it
So, look.
There's a larger issue at scale. The easier we make it for everyone to hop into Archive Team on a "just stopping in" basis, the more we end up with
MELLONCHOLY: Hi guyz!!!!
MELLONCHOLY: Sure do hate things going away
MELLONCHOLY: I want to help
MELLONCHOLY: I read the docs
MELLONCHOLY: I have submitted 409gb of warc grabs
MELLONCHOLY: Ooops, done wrong
MELLONCHOLY: Anyone here like pokemon
MELLONCHOLY: I found a rare one
[15:07]
Jonlol [15:09]
SketchCowMELLONCHOLY: Also, I have added a bunch of new things to the wiki
MELLONCHOLY: Why r u all so mean
[15:09]
JonTo get back to Sanqui's point, tbh I think the wiki would be fine for forum-like stuff , i.e. Talk: pages [15:10]
SketchCowI think if there's stuff being discussed, the Wiki is best now, now that jrwr has really spiffed things up [15:10]
Sanquitalk pages are pretty bad, but I agree that there's no clearly good solution [15:11]
SketchCowTalk pages are just fine
You just need discipline.
Speaking of which, I have over 40 windows open on this desktop doing archiving
[15:11]
Jon\o/
point of order. For archiving a Blu Ray, which is legitimately archive-able (Creative Commons) *but* has AACS DRM; it makes sense to archive after decrypting right?
the legal risk is in the process of decrypting; not receipt of post-decrypted stuff.
I need to double check the decryption process is precise and there's not a fidelity issue too
[15:12]
SketchCowFOS has two partitions. One is at 6%, one is at 61%. The 61% is basically my stuff so I'm trying to nail it down. (It's "Just" 2tb of stuff)
Ha ha legal risk
[15:13]
Jonwell I mean DMCA or whatever [15:13]
SketchCowI'll visit you in Blu-Ray Jail [15:13]
Jonkind offer [15:13]
SketchCowI'll bring a cake with a file [15:13]
Jonthis is probably why the existing BD rip on archive.org I'm aware of was darkened [15:14]
SketchCowWell, if it was something recent and obvious, I'm sure a bot found it [15:15]
Jonit's http://archive.org/details/NineInchNailsGhostsI-Ivblu-ray24bit96khz but I've already mailed info@ to discuss it, so not asking you to do anything here, just if you were curious
CC-BY-SA-NC
[15:15]
SketchCowIt was darked 3.1 years ago
There, I undarked it
[15:16]
Jonyeah I'm slow to get back to this, I backburnered looking into it near the time
oh wow thanks
[15:17]
***ranavalon has quit IRC (Read error: Connection reset by peer)
ranavalon has joined #archiveteam-bs
[15:23]
Sanquiyeah I think thunderbird is just frozen trying to download my gmail inbox. [15:29]
JAAYou should see that in the status bar near the bottom. "Downloading header x of n" or something like that. [15:31]
SketchCowI forgot to mention that of those 40 screens, something like 15 of them are running an analytic process against archive.org stacks of materials to find cases where I unwittingly broke things and to unbreak them
Just a huge fucking cleanup crew
[15:33]
Jonis it satisfying to have the machine(s) doing work, though, right? getting value's worth from the silicon [15:35]
SketchCowWell, I'm not sad or anything [15:35]
zinoOh, National Archives space uploads: https://www.kickstarter.com/projects/420606009/fight-for-space-space-program-and-nasa-documentary/posts/2089150?ref=backer_project_update => https://archive.org/details/@paul_hildebrandt [15:36]
SketchCowI'm back to no hours in the day currently. [15:36]
***Stilett0 has quit IRC (Ping timeout: 260 seconds) [15:37]
zinoSketchCow, maybe ignore doing some things. No hours left might be bad health-wise. [15:37]
SketchCowha ha
The gal keeps on top of me
I went to bed at midnight like a prole last night
[15:37]
zino:-) [15:38]
SketchCowLike a rube, a huckleberry, a member of the masses without classes
One window is fixing the covers of the zines section
[15:38]
...... (idle for 26mn)
***Valentine has quit IRC (Ping timeout: 506 seconds) [16:07]
.... (idle for 18mn)
jrwrSketchCow: there is a addon I can add in that adds forum like threads to talk pages
they work like reddit comments so threaded
[16:25]
...... (idle for 26mn)
***Stilett0 has joined #archiveteam-bs [16:51]
...... (idle for 25mn)
schbirid has joined #archiveteam-bs [17:16]
schbirid has quit IRC (Ping timeout: 252 seconds)
Harzilein has quit IRC (Quit: ircII EPIC5-2.0.1 -- Are we there yet?)
schbirid has joined #archiveteam-bs
C4K3 has quit IRC (Read error: Connection reset by peer)
C4K3 has joined #archiveteam-bs
[17:25]
PurpleSymSketchCow: Can I create new collections with these new archive.org superpowers? The NIN remixes would deserve one, imo: https://archive.org/search.php?query=subject%3Aremix.nin.com%20mediatype%3Aaudio%20subject%3Aarchiveteam [17:37]
............... (idle for 1h14mn)
***jdude104 has joined #archiveteam-bs [18:51]
SketchCowWho is DKL3. [19:02]
..... (idle for 22mn)
***medowar has quit IRC (Ping timeout: 252 seconds)
purplebot has quit IRC (Ping timeout: 252 seconds)
HCross2 has quit IRC (Ping timeout: 252 seconds)
odemg has quit IRC (Ping timeout: 252 seconds)
Rai-chan has quit IRC (Ping timeout: 252 seconds)
odemg has joined #archiveteam-bs
purplebot has joined #archiveteam-bs
medowar has joined #archiveteam-bs
HCross2 has joined #archiveteam-bs
svchost03 sets mode: +o HCross2
[19:24]
Rai-chan has joined #archiveteam-bs [19:39]
ndiddy has quit IRC ()
ndiddy has joined #archiveteam-bs
[19:44]
.... (idle for 17mn)
espes__ has quit IRC (Read error: Operation timed out) [20:01]
SketchCowStand down, tracked, thanks [20:09]
ola_norskSomebody2: I'm wondering if you ever got a response back on the email you sent to Norsk Dataforening? (i apologize if i mistake you for another user here) [20:20]
.... (idle for 16mn)
***espes__ has joined #archiveteam-bs [20:36]
.... (idle for 17mn)
icedice has joined #archiveteam-bs [20:53]
Kimmer has joined #archiveteam-bs
qw3rty17 has quit IRC (Nettalk6 - www.ntalk.de)
[20:58]
icedice has quit IRC (Quit: Leaving) [21:10]
....... (idle for 32mn)
Mateon1 has quit IRC (Read error: Operation timed out)
Mateon1 has joined #archiveteam-bs
[21:42]
..... (idle for 20mn)
Jusque has quit IRC (ZNC - http://znc.in) [22:02]
godane has quit IRC (Read error: Operation timed out)
Jusque has joined #archiveteam-bs
Jusque has quit IRC (Client Quit)
Jusque has joined #archiveteam-bs
[22:10]
..... (idle for 23mn)
Somebody2ola_norsk: Nope, no response yet. [22:38]
ola_norskSomebody2: aye, though there might have been some reorginazation going on there in the few weeks. Torp now seems to be listed as "general secretary".. http://www.dataforeningen.no/ansatte.134521.no.html ...It's typical fucking beurocracy [22:43]
Somebody2Eh, like I said -- my message was pretty much focused on "you don't need to reply to this" -- so not getting a reply just means they understood it. :-) [22:45]
***godane has joined #archiveteam-bs [22:45]
ola_norska "thank for your input" would've been courteous though [22:45]
***godane has quit IRC (Quit: Leaving.)
godane has joined #archiveteam-bs
[22:53]
ola_norsk"Wanting people to listen, you can't just tap them on the shoulder anymore. You have to hit them with a sledgehammer, and then you'll notice you've got their strict attention."
..or an open letter, signed by academics
[22:59]
***ola_norsk has left [23:00]
godanei'm uploading tons of odd recordings i have
this is so i can get rid of them once jason uploads them to archive.org
one of the odd ones is a recording of WMUR News9 At 11 on 1997-02-23
SketchCow: your getting a old local news recording from my personal tape collection
[23:09]
***qw3rty15 has joined #archiveteam-bs [23:23]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)