#archiveteam-bs 2017-12-17,Sun

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
godaneso i'm going to be reuploading the koreanet 1 chuncheon pg butitv
thats cause i screwed up the id names
[00:17]
***BlueMaxim has joined #archiveteam-bs [00:18]
JAAjoepie91: I'll have to do some more testing tomorrow, but I think I managed to port it correctly! :-) [00:19]
.... (idle for 18mn)
You're missing the pattern +!![] (== 1) in your script, by the way, and I think ++![] is a syntax error. [00:37]
.......... (idle for 49mn)
***ola_norsk has quit IRC (it's christmas! Drink your christmas beers https://youtu.be/NtV-EB8kvf8) [01:26]
.... (idle for 17mn)
LastNinja has quit IRC (Ping timeout: 260 seconds)
ZexaronS- has joined #archiveteam-bs
ZexaronS has quit IRC (Read error: Connection reset by peer)
[01:43]
Pixi has quit IRC (Ping timeout: 255 seconds)
Pixi has joined #archiveteam-bs
[01:53]
........ (idle for 38mn)
Stilett0 is now known as Stiletto [02:33]
Odd0002 has quit IRC (Quit: ZNC - http://znc.in) [02:46]
Odd0002 has joined #archiveteam-bs [02:53]
...... (idle for 29mn)
ola_norsk has joined #archiveteam-bs [03:22]
ola_norskif a WARC item is uploaded with 'Noindex: true' , (or even with 30-days 'test item'). Does it still go to waybackmachine?
regardless of how freakishly inclomplete it may or may not be
[03:24]
Somebody2ola_norsk: Only WARCs from trusted sources go into the Wayback Machine. "trusted" is a subtle and not-exactly-documented quality, though. [03:29]
ola_norskok [03:33]
....... (idle for 30mn)
***qw3rty112 has joined #archiveteam-bs
qw3rty111 has quit IRC (Read error: Operation timed out)
[04:03]
pizzaiolo has quit IRC (pizzaiolo) [04:14]
ivanSomebody2: not sure this is true any more, anyone can start uploading mediatype:web into Community Texts
I assume you can still get blacklisted
[04:17]
Somebody2ivan: Hm, neat.
ivan: have you verified that random mediatype:web items are included in the Wayback Machine, though?
[04:25]
ivanwell, mine are, but I doubt I got special treatment
try it and see!
[04:26]
ola_norskivan: i'm not risking blacklisted :D
ivan: e.g what could cause that, btw?
[04:27]
ivan1) cause what? 2) I have no idea probably [04:28]
ola_norsk< ivan> I assume you can still get blacklisted [04:29]
ivanI assume if someone at IA notices your WARCs are full of fake responses
or ISP/DNS content swaparoos
[04:29]
ola_norskbut e.g warcs from webrecorder should be ok i guess? [04:30]
ivanI would guess so [04:30]
ola_norskwould making them "NoIndex: true" prevent it from getting to waybackmachine?
webrecorder makes these 'patches', i mean, that'd i'd rather not see on my listing
preferably, i'd make it test items that are deleted after 30 days, as long as the warcs are processed by then
[04:31]
if "Noindex: true" could prevent it being shown as item, and it still got submitted to wayback, that would be 100% nice :D
e.g if that is/was the case, i could make an item containing the webrecorder warc, and it's 'patch' warcs i guess
[04:38]
ivanola_norsk: there's nothing wrong with having an item [04:46]
ola_norskivan: i have to look at it :D [04:46]
ivanI don't really look at my items [04:47]
ola_norskivan: and, it would be quite rudimentary, since i would'nt really bother with topics etc [04:47]
ivanjust tag with topic warcarchives [04:48]
ola_norskthen only one way to find out i guess :/ .. I just know the lack of thumbnail is going to be pester me though :D [04:51]
ivanI can only suggest OCDing about something else [04:53]
ola_norsk;) [04:53]
community web is not listed in ia browser uploader, is there argument for 'ia' tool to create an item of that sort?
i've never use 'ia' tool to create item, only upload to or alter
[04:58]
ivanno community web, only Community Texts, and I think it'll land there by default [05:00]
ola_norskk [05:00]
ivan#internetarchive [05:01]
***qw3rty113 has joined #archiveteam-bs [05:01]
ola_norskill just wing it with a dummyfile with random data in a test item and see where it lands when picking text [05:04]
ivanaren't test items going to land in the test collection [05:04]
***qw3rty112 has quit IRC (Read error: Operation timed out) [05:05]
ola_norski think that is a secondary entry of them [05:09]
ivancan't be in two collections [05:10]
ola_norskivan: https://archive.org/details/dummy_test_data [05:10]
ivanah, I totally forgot [05:11]
ola_norsklook at this messed up thing though :D https://archive.org/details/vidme_AfterPrisonJoe
'community data' :D
i'm thinking that's where random warcs might really belong at :D
it's not an option when uploading in browser though i think
might be what happens if making an item with 'ia' tool, without specifying anything, i cant remember
it's still in 'texts' collection, but mediatype is 'data' :DD
when using 'ia upload <somerandomitemid> *' , i mean
ivan: man, i just realized what you meant, i noticed first now the 'collection:' field on that dummyfile :D
sry
that could be 'collection: web' ?
[05:11]
ivan: should i un-gz the warcs first?
webrecorder downloads gzip'ed warcs it seems
[05:32]
ivanola_norsk: mediatype:web, not collection
don't un-gz
[05:33]
ola_norskk [05:33]
ivanno idea how to use ia but over the S3 interface it's header x-archive-meta-mediatype:web [05:34]
ola_norski tried 'ia upload ola_norsk_AGP_warcs theangrygranpa-20171217053032.warc.gz' , and it did upload
though, the item is (not yet) listed on my profile
adding the later 'patch' to that item seems to go as well
'ia ls ola_norsk_AGP_warcs'
seems to work fine :D , it's created the same derivs as when i added a warc to another item
and it's 'data' though :/
and mediatype can not be changed trough web ui
[05:39]
Somebody2ivan: I'm pretty sure you are whitelisted -- you are certainly known. The test would be to create a new account, and upload a WARC
from that, and see if it gets included.
(Or we could just ask, I suppose.)
(which I've now done)
[05:46]
ola_norskdoes it matter if 'mediatype' is set to 'web' on an item, for it to be applied to wayback ? [05:50]
Somebody2Yes, I'm pretty sure mediatype:web is required. [05:50]
ola_norskdang, how can https://archive.org/details/ola_norsk_AGP_warcs be changed from 'data' to 'web' ? [05:51]
ivaninfo@archive.org [05:52]
ola_norski was afraid of that would be the answer :D [05:52]
ivanactually, has anyone tried changing mediatype over S3? is the change ignored?
"only Archive admins can make that change." https://archive.org/post/1064443/change-media-type
[05:53]
ola_norskaye, and it can not be done trough web ui
ill type the mail later today when i've slep and sober i think, since also the afterprisonjoe item needs changing
for future reference, how might i specify 'mediatype: web' at upload with ia command line?
nevermind, i realize what ivan meant by header 'x-archive-meta-mediatype:web'
'ia upload --header=x-archive-meta-mediatype:web <item> <file>' i think
[05:57]
Somebody2No, I don't think that's right. [06:11]
ola_norsk-H, --header=<key:value>... S3 HTTP headers to send with your request. [06:11]
Somebody2You should use --metadata instead
--metadata=mediatype:web
The header form may work, too, though.
[06:11]
ola_norskhttps://github.com/vmbrasseur/IAS3API/blob/master/headers.md [06:13]
Somebody2Yeah, that suggests that either way should work. [06:15]
ola_norskok
i'm going to have to stay away from writing ia command lines i think, and just make e.g 'webiaarchive.sh' and 'videoiaarchive.sh' :D
better yet, an 'archivefordummy.sh' using dialog :D
[06:16]
Somebody2ha
Please do write them, yes.
[06:21]
ola_norskconsider it halfassed! https://youtu.be/ATBl4qH9I54
...(serioudly though, i'll try')
"What type of media would you like to upload?" ..kind of thing
[06:22]
***ola_norsk has quit IRC (kicked by ICANN for internetting under the influence) [06:25]
.... (idle for 18mn)
RichardG_ has quit IRC (Ping timeout: 255 seconds) [06:43]
...... (idle for 25mn)
kimmer12 has joined #archiveteam-bs [07:08]
kimmer1 has quit IRC (Read error: Operation timed out) [07:14]
kimmer1 has joined #archiveteam-bs [07:23]
kimmer13 has joined #archiveteam-bs
kimmer12 has quit IRC (Ping timeout: 633 seconds)
kimmer1 has quit IRC (Read error: Operation timed out)
kimmer1 has joined #archiveteam-bs
[07:28]
kimmer13 has quit IRC (Ping timeout: 633 seconds) [07:44]
........... (idle for 53mn)
ZexaronS- has quit IRC (Read error: Connection reset by peer)
ZexaronS- has joined #archiveteam-bs
[08:37]
........ (idle for 39mn)
ranmais nforce entertainment b.v in any way related to the old(?) nforce site?
the one with all the NFOs
[09:17]
.... (idle for 18mn)
***schbirid has joined #archiveteam-bs [09:35]
..... (idle for 24mn)
vantecSeems to be, but don't see them outright saying it anywhere. [09:59]
............... (idle for 1h13mn)
***schbirid has quit IRC (Quit: Leaving) [11:12]
...... (idle for 27mn)
BlueMaxim has quit IRC (Quit: Leaving) [11:39]
....... (idle for 34mn)
pizzaiolo has joined #archiveteam-bs
jschwart has joined #archiveteam-bs
[12:13]
.... (idle for 16mn)
odemg has quit IRC (Quit: Leaving) [12:33]
kimmer1 has quit IRC (Remote host closed the connection)
kimmer1 has joined #archiveteam-bs
icedice has joined #archiveteam-bs
[12:41]
JAAivan, Somebody2: My first (and so far only) upload a few weeks ago was included in the WM within a few hours, I believe. I don't know whether that was manually approved or not though. I'm pretty sure that the derive task ran immediately, but that doesn't really mean much I guess. [12:49]
***ZexaronS- has quit IRC (Quit: Leaving) [12:50]
JAAMrRadar, Sanqui: FYI, #msgbored is open again, we managed to cycle it. (I sent you an invite, but I guess you might've missed it.) [12:52]
..... (idle for 20mn)
***odemg has joined #archiveteam-bs [13:12]
.... (idle for 19mn)
LastNinja has joined #archiveteam-bs [13:31]
............. (idle for 1h0mn)
RichardG has joined #archiveteam-bs [14:31]
icedice2 has joined #archiveteam-bs [14:39]
icedice has quit IRC (Ping timeout: 506 seconds) [14:46]
....... (idle for 34mn)
icedice2 has quit IRC (Quit: Leaving) [15:20]
..... (idle for 20mn)
kimmer12 has joined #archiveteam-bs [15:40]
kimmer1 has quit IRC (Ping timeout: 632 seconds) [15:47]
.................. (idle for 1h28mn)
Stiletto has quit IRC (Read error: Operation timed out) [17:15]
....... (idle for 34mn)
Somebody2JAA: What's your account name on IA? [17:49]
...... (idle for 29mn)
***schbirid has joined #archiveteam-bs [18:18]
.... (idle for 18mn)
du_ has joined #archiveteam-bs [18:36]
........ (idle for 36mn)
Mateon1 has quit IRC (Ping timeout: 248 seconds)
Mateon1 has joined #archiveteam-bs
[19:12]
..... (idle for 22mn)
kimmer12 has quit IRC (Quit: Yaaic - Yet another Android IRC client - http://www.yaaic.org)
kimmer1 has joined #archiveteam-bs
Stilett0 has joined #archiveteam-bs
[19:34]
.... (idle for 15mn)
antomaticNgh. Good old ContentID. "This video has 11 seconds of grass and men and balls being kicked. Blocked worldwide!" .... [19:49]
.............. (idle for 1h9mn)
JAASomebody2: JustAnotherArchivist [20:58]
***BlueMaxim has joined #archiveteam-bs [21:01]
........... (idle for 51mn)
schbirid has quit IRC (Quit: Leaving) [21:52]
.... (idle for 19mn)
Somebody2Whoops, now I'm in the right channel. [22:11]
JAA:-) [22:11]
....... (idle for 33mn)
***jschwart has quit IRC (Quit: Konversation terminated!) [22:44]
pizzaiolo has quit IRC (pizzaiolo)
RichardG_ has joined #archiveteam-bs
[22:52]
godanebiography.com video urls don't download anymore : https://pastebin.com/dRhf6y8U
i figure i ask people here to see if anyone can fix it
[22:54]
***ndiddy_ has joined #archiveteam-bs [22:54]
godanelast time i download was from site was 2017-10-28 [22:54]
***K4k_ has joined #archiveteam-bs
ppsym has joined #archiveteam-bs
[22:55]
tuluu_ has joined #archiveteam-bs
RichardG has quit IRC (se.hub irc.underworld.no)
MrDignity has quit IRC (se.hub irc.underworld.no)
ndiddy has quit IRC (se.hub irc.underworld.no)
espes__ has quit IRC (se.hub irc.underworld.no)
tuluu has quit IRC (se.hub irc.underworld.no)
purplebot has quit IRC (se.hub irc.underworld.no)
PurpleSym has quit IRC (se.hub irc.underworld.no)
K4k has quit IRC (se.hub irc.underworld.no)
Rai-chan has quit IRC (se.hub irc.underworld.no)
i0npulse has quit IRC (se.hub irc.underworld.no)
medowar has quit IRC (se.hub irc.underworld.no)
espes___ has joined #archiveteam-bs
[23:02]
ppsym is now known as PurpleSym [23:20]
...... (idle for 25mn)
BlueMaxim has quit IRC (Read error: Connection reset by peer)
BlueMaxim has joined #archiveteam-bs
[23:45]
MrDignity has joined #archiveteam-bs [23:58]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)