#archiveteam-bs 2017-12-12,Tue

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***BlueMaxim has quit IRC (Read error: Connection reset by peer)
marvinw is now known as ivan
[00:07]
ivanJAA: want to write another web crawler using https://github.com/GoogleChrome/puppeteer? ;) [00:12]
JAAivan: PurpleSym has been working on something like that. (Not with Node though, fortunately.)
https://github.com/PromyLOPh/crocoite
[00:15]
ivancool [00:17]
JAAI'm still considering building a minimal tool with aiohttp and warcio though, primarily for relatively simple stuff like APIs where the heavy machinery of a browser isn't needed.
But for now I'm busy rescuing the crashed CompuServe ArchiveBot job.
[00:19]
CoolCanukwhy anothe web crawler/ [00:22]
JAAThe browser-based one because wpull doesn't handle heavily scripted websites well. It also means we can support HTTP/2, which may improve performance as a side-effect. Furthermore, the traffic will look much more real, so it might help getting around (some) bans.
The other one because wpull has a ton of ugly bugs that make it quite annoying to work with at times.
[00:26]
CoolCanuk:/ [00:26]
***bithippo has joined #archiveteam-bs [00:26]
JAAAnd I think aiohttp+warcio would also be more lightweight than wpull. It would be a quite simplistic and specialist tool for certain use cases only, in particular API archiving (I've done a few of those with wpull).
I have no intention of rebuilding wpull.
That time would be better spent in debugging wpull instead. It's a great tool, it just suffers from a number of issues that make it barely usable really.
[00:27]
.... (idle for 18mn)
godaneso i did 2 tapes
one is Loch Ness Discovered tape
another is called Titanic
that aired on A&E
both are from 1994
[00:45]
***pizzaiolo has joined #archiveteam-bs [00:49]
ZexaronS- has joined #archiveteam-bs
ZexaronS has quit IRC (Read error: Operation timed out)
[00:54]
JAAI'm impressed. My IP is still banned at Wine.Woot (since three weeks now). [01:00]
....... (idle for 33mn)
***bithippo has quit IRC (My MacBook Air has gone to sleep. ZZZzzz…)
zalgo has joined #archiveteam-bs
zalgo has quit IRC (Remote host closed the connection)
zalgo has joined #archiveteam-bs
[01:33]
.... (idle for 18mn)
pizzaiolo has quit IRC (Remote host closed the connection)
zalgo has quit IRC (Remote host closed the connection)
[02:03]
.... (idle for 16mn)
ivanI use grab-site and wpull 1.2.3 all the time and while it's useful only like 90% of the time I wouldn't call it barely usable
sure it would be nice to archive everything but there's plenty of stuff it can archive
without it maybe you'd be dealing with heritrix or httrack :-)
[02:22]
JAAOh yeah, 1.2.3 is pretty good. The plugin interface of 2.0 is much better though.
And yes, it's definitely the best tool we have.
(I'm using 1.2.3 for most of my manual grabs as well.)
[02:27]
...... (idle for 26mn)
***ZexaronS- has quit IRC (Leaving) [02:54]
...... (idle for 28mn)
CoolCanuk has quit IRC (Quit: Connection closed for inactivity) [03:22]
BlueMaxim has joined #archiveteam-bs [03:36]
....... (idle for 33mn)
qw3rty116 has joined #archiveteam-bs [04:09]
qw3rty115 has quit IRC (Read error: Operation timed out) [04:15]
..... (idle for 20mn)
ranavalon has quit IRC (Remote host closed the connection)
ranavalon has joined #archiveteam-bs
[04:35]
.............. (idle for 1h7mn)
Dimtree has quit IRC (Read error: Operation timed out) [05:43]
.... (idle for 18mn)
Dimtree has joined #archiveteam-bs
Dimtree has quit IRC (Client Quit)
Dimtree has joined #archiveteam-bs
[06:01]
....... (idle for 34mn)
Dimtree has quit IRC (Read error: Operation timed out)
Pixi has quit IRC (Ping timeout: 255 seconds)
Pixi has joined #archiveteam-bs
[06:37]
Mateon1 has quit IRC (Remote host closed the connection)
kimmer2 has quit IRC (Ping timeout: 633 seconds)
Mateon1 has joined #archiveteam-bs
[06:49]
kimmer2 has joined #archiveteam-bs [07:00]
omglolbah has quit IRC (Ping timeout: 250 seconds)
tuluu has quit IRC (Read error: Operation timed out)
tuluu has joined #archiveteam-bs
Mateon1 has quit IRC (Read error: Operation timed out)
Mateon1 has joined #archiveteam-bs
Dimtree has joined #archiveteam-bs
[07:10]
Asparagir has quit IRC (Asparagir) [07:27]
........ (idle for 35mn)
Mateon1 has quit IRC (Remote host closed the connection)
Mateon1 has joined #archiveteam-bs
[08:02]
PurpleSymJAA: Distributed archiving using celery and an IRC bot are functional, but not checked in yet. I’m setting up a testing environment currently. [08:14]
.... (idle for 19mn)
***du_ has quit IRC (Ping timeout: 260 seconds) [08:33]
Mateon1 has quit IRC (Remote host closed the connection)
Mateon1 has joined #archiveteam-bs
[08:42]
................... (idle for 1h31mn)
godane has quit IRC (Read error: Operation timed out) [10:14]
godane has joined #archiveteam-bs [10:25]
....... (idle for 32mn)
pizzaiolo has joined #archiveteam-bs [10:57]
......... (idle for 43mn)
BlueMaxim has quit IRC (Quit: Leaving) [11:40]
.... (idle for 16mn)
refeed has joined #archiveteam-bs
refeed has quit IRC (Client Quit)
refeed has joined #archiveteam-bs
[11:56]
_refeed_ has joined #archiveteam-bs
refeed has quit IRC (Read error: Connection reset by peer)
Mateon1 has quit IRC (Remote host closed the connection)
Mateon1 has joined #archiveteam-bs
[12:06]
__refeed_ has joined #archiveteam-bs
_refeed_ has quit IRC (Read error: Connection reset by peer)
[12:20]
_refeed_ has joined #archiveteam-bs
__refeed_ has quit IRC (Read error: Connection reset by peer)
refeed has joined #archiveteam-bs
refeed has quit IRC (Connection closed)
_refeed_ has quit IRC (Read error: Connection reset by peer)
refeed has joined #archiveteam-bs
refeed has quit IRC (Client Quit)
[12:29]
Mateon1 has quit IRC (Remote host closed the connection)
Mateon1 has joined #archiveteam-bs
[12:48]
..... (idle for 21mn)
Nugamus has joined #archiveteam-bs [13:10]
godane has quit IRC (Quit: Leaving.) [13:18]
TheLovina has quit IRC (Read error: Operation timed out) [13:23]
......... (idle for 42mn)
du_ has joined #archiveteam-bs [14:05]
.... (idle for 16mn)
K4k has quit IRC (Ping timeout: 260 seconds) [14:21]
K4k has joined #archiveteam-bs
Stilett0 has joined #archiveteam-bs
K4k has quit IRC (Ping timeout: 250 seconds)
[14:29]
JAASketchCow: Another random "a" file in item archiveteam_archivebot_go_20171210190001 (see my messages in here about three days ago). This time, it's a WARC again.
See also #archivebot
[14:35]
***K4k has joined #archiveteam-bs [14:40]
JAAFigured it out, it was a typo on one of the ArchiveBot pipelines. I hope it didn't overwrite any files.
I'll send you a list of what each of those files should be named via email.
[14:48]
.... (idle for 16mn)
***Mateon1 has quit IRC (Remote host closed the connection)
Stilett0 is now known as Stiletto
[15:05]
...... (idle for 26mn)
DrasticAcOkay, my first pass of the parsed Miiverse Database finished up over the weekend. I have to redownload some WARCs from IA that were corrupted and validate one more time to make sure I got everything, but it's a pretty good representation of what we got last month. Total size so far of the database is 191 GB.
We saved 2,117,420 deleted posts (by marking their non-existence), 128,727,868 posts, and 206,473,819 replies.
We saved 69,955,548 drawings, of which 27,086,692 were posted in replies, which Nintendo didn't send to users in their "archive".
Likewise, we saved 72,985,588 screenshots, with 14,089,107 posted in replies.
Here's a spreadsheet of the amount of posts and replies per day. https://usercontent.irccloud-cdn.com/file/UJeNuOnR/TotalTimestamp.xlsx
Once I validate the database, I'll throw it on IA.
[15:32]
***refeed has joined #archiveteam-bs
refeed has quit IRC (Connection closed)
refeed has joined #archiveteam-bs
[15:44]
_refeed_ has joined #archiveteam-bs
refeed has quit IRC (Read error: Connection reset by peer)
[15:56]
....... (idle for 31mn)
__refeed_ has joined #archiveteam-bs
refeed has joined #archiveteam-bs
_refeed_ has quit IRC (Read error: Connection reset by peer)
__refeed_ has quit IRC (Read error: Connection reset by peer)
[16:31]
SketchCowJAA: HI
Yeah, I'm all for fixing these after the fact
FOS is dying under this Manga Plus Everything Else
[16:38]
***_refeed_ has joined #archiveteam-bs [16:39]
SketchCowSo I'm going to just upload and then we'll issue fixes and re-checks [16:39]
IglooSketchCow: i've raised the first pull to fix the 2.0.3 issue
I'm writing the changes to uploader now
[16:39]
***refeed has quit IRC (Read error: Connection reset by peer) [16:42]
SketchCowGreat
But yeah, we are due lots of audit of already uploaded shiznat, frankly
I do it when I can
Right now, though, FOS is definitely heaving and I'm trying to fix that
Some of it is just because it's the new FOS and automatic upload processes weren't running
Other is MANGA
MANGA baby
[16:45]
JAASure, we can fix that down the line as well.
Can we prevent files being overwritten though?
We probably lost about a third of Cracked due to that, for example.
[16:47]
SketchCowIs cracked deleted?
I mean, we can re-do it
[16:48]
JAANo, it's still online. [16:49]
SketchCowBut everything I see is that it is on the rsync command side, not the server side.
So some --ignore-existing or something
[16:49]
JAAHmm. [16:49]
SketchCowSo it's the pipeline, really [16:49]
JAAYeah, it should be possible to do something there. [16:49]
SketchCowI don't THINK there's an rsyncd setting (or rsyncd.conf setting) to stop overwriting, but I'll look
https://download.samba.org/pub/rsync/rsyncd.conf.html
If someone sees something, I'll toss it in
I mean, I looked and it REALLY looks like it's on the rsync command's side to not be the asshole
[16:49]
JAAMaybe something could be done with "filter".
"Files excluded by the daemon filter chain (daemon-excluded files) are treated as non-existent if the client tries to pull them, are skipped with an error message if the client tries to push them (triggering exit code 23), and are never deleted from the module."
That filter would have to be updated with the current file list after every upload, I think.
[16:54]
SketchCowThis is too much work.
Client just needs a new setting.
[16:56]
***__refeed_ has joined #archiveteam-bs [16:56]
sec0ndhttps://storify.com/faq-eol [16:57]
SketchCowha ha wow [16:59]
***_refeed_ has quit IRC (Read error: Connection reset by peer) [17:00]
JAASketchCow: I can come up with a number of ugly workarounds. For example, periodically moving the files to a different directory with mv -n. The rsync target directory would just be an "inbox". If that move command is running every few minutes, the probability of anything being overwritten should be very small. [17:02]
***CoolCanuk has joined #archiveteam-bs [17:02]
JAAOr chattr +i every file. (Needs to be undone to delete it after upload to the IA systems though, obviously.) [17:02]
CoolCanukI heard JAA's IP is banned from wine.woot [17:03]
JAAIn any case, yes, we should add a flag to the client side, but I'm not a fan of relying on that. [17:03]
SketchCowI think that if we don't rely on the clients being functional, we're sunk anyway [17:03]
JAAWell yeah, but the clients should only be able to do what they're supposed to do, i.e. push new data onto FOS. [17:04]
joepie91how about just using `incoming chmod` to set `ugo-w` for each transfer, preventing further writes to that file?
not sure if that'll work
[17:05]
***K4k has quit IRC (Read error: Operation timed out) [17:05]
joepie91it doesn't seem to have an incoming chown option or I'd have suggested changing its owner to a non-rsync owner [17:06]
JAAI'd be surprised if it did, to be honest. [17:06]
joepie91worth a shot? [17:06]
JAAIt's the write permission of the directory that dictates whether you're able to overwrite a file. [17:06]
joepie91I... don't think so? [17:06]
JAAtouch file && chmod -w file && rm -f file
I don't think rm will restore the write permission before it deletes the file.
Deleting a file simply means removing the directory entry, so...
[17:07]
***sep332 has joined #archiveteam-bs [17:07]
JAAAnd you *can* overwrite read-only files with rsync, so... [17:07]
joepie91hm [17:07]
IglooJAA: i'll update the uploader.py too [17:10]
JAAThanks [17:10]
***bwn has quit IRC (Read error: Operation timed out) [17:11]
Cyn_ has joined #archiveteam-bs
BnAboyZ has quit IRC (Quit: The Lounge - https://thelounge.github.io)
[17:22]
..... (idle for 24mn)
icedice has joined #archiveteam-bs
schbirid has joined #archiveteam-bs
[17:48]
_refeed_ has joined #archiveteam-bs [17:56]
__refeed_ has quit IRC (Read error: Connection reset by peer) [18:01]
K4k has joined #archiveteam-bs [18:07]
.... (idle for 18mn)
icedice has quit IRC (Read error: Operation timed out)
icedice has joined #archiveteam-bs
icedice2 has joined #archiveteam-bs
icedice has quit IRC (Ping timeout: 260 seconds)
[18:25]
.... (idle for 19mn)
bwn has joined #archiveteam-bs
__refeed_ has joined #archiveteam-bs
[18:54]
_refeed_ has quit IRC (Read error: Connection reset by peer) [19:01]
hook54321Does anyone know if myspleen invites are open? [19:02]
***icedice has joined #archiveteam-bs
icedice has quit IRC (Client Quit)
icedice2 has quit IRC (Read error: Operation timed out)
[19:15]
omglolbah has joined #archiveteam-bs [19:30]
jschwart has joined #archiveteam-bs
ola_norsk has joined #archiveteam-bs
[19:37]
ola_norskdoes anyone know if "<item>/history/file/*~1~" serve a special purpose on internet archive? [19:38]
astridexample? [19:38]
ola_norskthe 'history' folder does not seem to show up in item editor [19:39]
astridpls link to an example, i have not seen this before [19:39]
ola_norsk1 sec
in the item sowhat-vidme_archive, "ia ls <item>" returns several files listed with such a directory (history/files/), e.g the file "history/files/20171111_YouTubers explain YT Censorship_Iv3WZ.description.~1~"
and the ".~1~" seems to have been added to original filename
filenames*
[19:39]
astridlooks like it retains changes in a directory /history/files/ https://archive.org/download/sowhat-vidme_archive/history/files/
but other items that i uploaded and modified don't have the directory
[19:45]
ola_norskfirst time i've seen it [19:46]
astridi know .~1~ is used as a version suffix for backup files that emacs and some other tools create [19:46]
ola_norskis it possible it's just a case of wrong detection of file format?
or do you mean e.g -re-uploaed files?
[19:47]
astridthinking the second one [19:47]
IglooOn that job a lot of work was re-done
So likely
[19:48]
ola_norskthe guy apparently has 1080+ videos on his vidme
so 'a lot of work' might also make sense :D
[19:48]
IglooWe've done somewhere around 70TB of data from vid.me
It's around 1.2PB in size
[19:49]
ola_norski simply focused on some channels with videos that i know are not on youtube.. E.g "AfterPrisonJoe" from "AfterPrisonShow" on youtube
wasn't it 600TB ?
[19:50]
IglooThat was our estimate. We got hold of someone who worked there, It's a lot more... [19:51]
ola_norskdoh
maybe 'job scripts' should simply be made the moment a new service pops up :D
though there's no doubt a majority of that 1.2PB is most likely already existing on youtube
[19:51]
***_refeed_ has joined #archiveteam-bs
__refeed_ has quit IRC (Read error: Connection reset by peer)
[19:56]
ola_norska bit is the users fault though, as i see it. There's e.g no reason for a having a news commentary video in 1080p, max bitrate, and at 60fps..
example, the 'reading scary creepypasta stories'
most often composed of someone narrating over what's basically slideshows of still images
[20:01]
JAAThat should compress extremely well though. [20:04]
ola_norskyeah, but if there's smoke effects over it [20:04]
***__refeed_ has joined #archiveteam-bs [20:06]
ola_norski don't know how compression on vidme worked. But most of the files in the item i mentioned at join are basically the person browsing german newspapers while delivering english commentary. Some are 1 hour +
and i think i saw one of those 1+h videoes being 1+ GB
i'm not saying i pirate movies, but 1.3GB seems to be well for a 1080p movie
[20:08]
***_refeed_ has quit IRC (Read error: Connection reset by peer) [20:10]
Frogging1.3GB for a full length movie at 1080p is bitrate starved [20:12]
ola_norsk'starved' ? [20:14]
Aoedeit looks like shit [20:20]
Froggingyeah, as in, not enough bitrate to maintain a high level of quality at that resolution, at least for a live action movie. if a 1.3GB video is 2 hours long, the average bitrate is about 1.4Mbps. it's subjective and does depend on the codec, but to me it seems low for 1080p. [20:20]
ola_norsk"eye of the beholder then" :) [20:21]
zinoIt measurably bad in the psychovisual model. [20:21]
ola_norsk'measurably worse'...'bad' is relative
but anyway, i'm not codec expert :D
[20:22]
JAAWho doesn't like nice blocks and banding? /s [20:23]
FroggingJAA: VP9 pisses me off with how much banding it creates in dark scenes
even at relatively high nitrates
bit
[20:24]
JAADon't have much experience with VP9. [20:29]
Froggingfeel like it's a step backwards from H.264 at least in that record
regard* (autocorrect)
[20:29]
JAAUhm, did IA just change the /save page? [20:30]
ola_norsk:/
web.archive.org/save ?
[20:30]
JAAI'm getting a page with a textarea "List of archived elements in reverse order" instead of the usual save page.
Yeah
[20:31]
ola_norski hope not [20:31]
JAAIt also doesn't actually archive anything for me.
As far as I can tell anyway.
[20:31]
IglooShows same as normal to me [20:31]
ola_norskas long as it's just the looks [20:32]
***Coderjo has quit IRC (Read error: Operation timed out)
yipdw has quit IRC (Read error: Operation timed out)
[20:33]
ola_norskhow else could i be capturing rather repetitiv tweets about #netneutrality :/ [20:33]
Igloo#archivebot
;-)
[20:33]
***me has joined #archiveteam-bs [20:33]
ola_norskwouldn't that mean i had to download it first? [20:33]
JAAhttps://share.riseup.net/#tuQ-OKPcwjop11jI_XzhzA [20:34]
IglooNope, Just feed it URLs [20:34]
JAAIt's just sitting there forever. [20:34]
IglooAnd then it does the rest [20:34]
MrRadar2For Twitter, make sure you use --phantomjs with archivebot
Since Twitter uses JS for pretty much everything
[20:34]
ola_norskit would autoscroll? [20:34]
IglooIndeed [20:34]
JAANo, because it's broken. [20:34]
IglooUse the right pipeline and it works ;-) [20:35]
JAAWell, yeah.
Does it work on yours?
[20:35]
IglooOn CA it does
I haven't tested the rest
[20:35]
JAALet's move this to over there. [20:36]
***atomicthu has quit IRC (Ping timeout: 260 seconds) [20:36]
JAANope, can't save anything into the Wayback Machine right now. Ew. [20:36]
***atomicthu has joined #archiveteam-bs
kimmer1 has joined #archiveteam-bs
Coderjo has joined #archiveteam-bs
kimmer12 has quit IRC (Read error: Operation timed out)
[20:38]
CoolCanuk:O
did WE break it?!?
[20:46]
JAALol no [20:47]
Frogging/save/ appears broken for me too [20:48]
hook54321same [20:52]
***_refeed_ has joined #archiveteam-bs [20:56]
JAAGood to see that I'm not the only one. I've sent an email to info@ just in case they don't know about it already. [20:57]
***__refeed_ has quit IRC (Read error: Connection reset by peer) [21:00]
_refeed_ has quit IRC (Leaving) [21:11]
hook54321Somebody2: I noticed in #archivebot logs a few months ago you tagged me and a few other people and said that the owner of autistics.org wanted to pass the site onto someone else, and that we could redirect stuff to pages on the wayback machine. If that's still open I'd be interested in that. [21:20]
.... (idle for 16mn)
ola_norskthe problem with 'tweep' is that it seems to be oriented to specified user account
oooops', no it doesn't "python tweep.py -s pineapple - Collect every Tweet containing pineapple from everyone's Tweets."
[21:36]
JAAYep, and it looks like you can just search for the hashtag there.
(For those out of the loop, we're talking about https://github.com/haccer/tweep )
[21:40]
ola_norskwhat if it just bounce around on twitter domain, checking every tweet? :D that could take some time :D [21:44]
JAAHave fun with that. [21:45]
ola_norskskål! [21:45]
....... (idle for 32mn)
JAAGot a reply from Mark, they're "testing an update to page of our Save Page Now service". He asked me to try again, and my saves went through this time. [22:17]
***Cyn_ has quit IRC (Ping timeout: 262 seconds) [22:23]
ola_norsktweep appears to be broken :/ [22:25]
antomaticNyaaaa! YouTube won't let accounts upload more than 100 videos per day.
Progress on the news archive is gonna be slow. :)
[22:25]
ola_norskantomatic: at least that's not a sign it's all going to hell :) [22:27]
***schbirid has quit IRC (Quit: Leaving)
PoorHomes has joined #archiveteam-bs
PoorHomie has quit IRC (Quit: No)
[22:30]
ola_norskit's about high time users adopt a pessim-op-tic view of "the internet" https://youtu.be/1VD_pJOFnZ0 [22:33]
antomaticIf I upload 100 items each and every day, then I think I can get everything up to the end of 2017 uploaded... by the end of 2018. :) [22:33]
***PoorHomes is now known as PoorHomie [22:33]
ola_norskantomatic: if i had to guess, the cause might be users of several nations trying to reupload their catalog that they moved to vidme [22:34]
antomaticHeh. I think it's just a generic dont-spam-us-too-hard measure. :) [22:35]
ola_norskantomatic: that, or youtube's wallet is getting a bit scrawny for Google to be happy with.. :/ [22:36]
antomaticThis is a waste of time anyway with the 3 strikes rule, because /someone/ is bound to object hard enough to it and the whole lot comes down
still, gives the opportunity to trim the recordings nicely, collate some metadata and run out the caption transcripts.
[22:36]
hook54321antomatic: Can you upload them to archive.org? [22:37]
ola_norskthe entire internet needs a trimming, there's too much monopoly going on [22:38]
antomaticI don't know if archive.org necessarily wants 30tb+ of news bulletins though. :) [22:38]
astriddid you email info@ like we discussed [22:39]
antomaticdid we discussed? sorry, must have missed that [22:39]
hook54321Don't they already record tv news though? [22:40]
antomaticUS/International yes, I think, but I don't know if they record UK domestic [22:40]
astridyeah we spoke about this a few days ago [22:41]
***pizzaiolo has quit IRC (Ping timeout: 246 seconds) [22:41]
astridlook in your logs around 5th december
you and i
[22:41]
hook54321I asked Jason if IA records Catalan TV once and he said "ish" [22:42]
ola_norsk'so and so' recording? [22:43]
antomaticastird: ah, yes, I did see that.
thx
wanted to check whether I had a decent process for transcoding & transcribing it all first
[22:43]
ola_norskwhere can i find which channels IA records? most of NRK (norwegian state television) is nation locked [22:44]
antomaticoops, sorry, astrid. thanks. typo. :) [22:45]
astridIA will transcode it
transcription is something else
[22:45]
ola_norskyoutube caption autogenerated is horrible :( [22:46]
antomaticMm, some of the source files are a little proprietary, but I can bounce them out to .TS which I believe IA accepts
I can extract the live captioning to SRT, which should be /something/
[22:46]
astridoh that's good [22:47]
antomaticalthough the accuracy is a little ropey. seems to have improved some over the last few years
YouTube's autosubs are actually *amazingly* good - in their own way - compared to the broadcast subs
[22:47]
ola_norsknot only is it vtt, but their translatiation is shit. Even at rather clean english (i think?) https://youtu.be/fpZD_3D8_WQ [22:48]
antomaticBut they totally fall to bits when background noise reaches a certain level [22:48]
ola_norskah... [22:48]
antomaticThey've improved *enormously* over the last year or so [22:48]
ola_norskthat video got that :D [22:48]
antomaticstill a long way from perfect though [22:48]
***Pixi has quit IRC (Ping timeout: 255 seconds) [22:49]
JAAhook54321: CCMA? [22:49]
***Pixi has joined #archiveteam-bs [22:50]
ola_norskantomatic: and yet, there's Google Deepmind AI able to master chess after just 4 hours..
ola_norsk is not impressed
if i was more Alex Jonesy, i'd be likely to say that when these supermachines beat human champs in Chess and Go, it's got to do with money paid to the looser...
[22:51]
antomaticDid you see that video of Alex Jones 'interviewing' an Amazon Echo? :) [22:54]
ola_norsklol no
I like Alex 'coloidal silver niacent' Jones :D
[22:54]
antomaticHe's all like 'Alexa, who is Jeff Bezos?', 'Alexa, you are lying to me', etc. Nuts. :) [22:55]
ola_norskAlex's got 'Brain Power' to prevent being beaten by any disinformation or misleading [22:56]
antomaticHehe. :) [22:57]
MrRadar2Do you have a link to that "interview", sounds hilarious [22:57]
ola_norskaye, givvus link [22:57]
antomatichttps://www.youtube.com/watch?v=u5kNP7tyhk8 [22:58]
MrRadar2thx
I'll be watching that in a private browsing window so as to not pollute my Youtube history
[22:59]
***JAA sets mode: +b *!*@185.143.40.157 [22:59]
JAA(Just a preventive ban for a spammer that showed up in #vidmeh and #miiworse) [23:00]
ola_norsklol, i love how he's stern when saying "Alexa! ; .." ..like talking to a little girl child :D [23:00]
MrRadar2"Alexa, I have mainstream news articles that Amazon is owned by the CIA" XD [23:00]
ola_norsk"Aleeexaaaa; did you steal <item> out of fridge?" [23:01]
***BlueMaxim has joined #archiveteam-bs [23:01]
MrRadar2LOL, Alex Jones fails the Turing test [23:02]
antomaticThat *look* on his face, like he's seriously thinking everything through and knows that if he asks the right question he'll blow the whole thing open at any moment. :) [23:02]
ola_norsklol
"Alexaaaaa! Have you finished your homework?"
"Alexaaa! Did you take out the trash?"
[23:03]
MrRadar2Wait, did he just spin that whole crazy video out of that deal that Amazon made to have a dedicatd AWS region for the federal government??? [23:04]
ola_norskwell, i guess it's settled, it's the "ultimate control system" [23:06]
antomaticMm, there was a viral [jukin] the day before where someone asked their Alexa about the CIA and it turned off, apparently.
(well, just didn't answer, really)
[23:06]
ola_norskfor it to detect even "Alexa!", it DOES mean that it's listening all the time though.. [23:07]
antomaticmm, it's listening locally (not streaming to the cloud) until it hears 'Alexa', then it sends your voice to the cloud to be recognised/answered.
supposedly.
but of course, it's an internet-connected mic and how it works is just down to the software.
[23:08]
***CoolCanuk has quit IRC (Quit: Connection closed for inactivity) [23:11]
PoorHomieBut TBF, the mute button on the top is a hardwired switch [23:11]
antomaticah, nice [23:11]
PoorHomieWhich shows at least a little bit about security [23:12]
***CoolCanuk has joined #archiveteam-bs [23:12]
PoorHomieWhich shows at least Amazon cares a little bit about security * [23:12]
antomaticimportant for when Alex is discussing international affairs. :)
Whoa. Alex. Alexa. How deep does the conspiracy go? :)
[23:12]
ola_norsksomeone needs to check out Alexa with wireshark! See when she says something to the internet and when she doesn't!
lol
[23:13]
PoorHomieit's been checked to death [23:13]
ola_norskaye [23:13]
antomatic"Alexa. Order ladies' panties." [23:13]
PoorHomieit only streams when you say Alexa
period
But of course, whoever controls the update keys can change that at any second, without your knowledge
[23:13]
***JAA sets mode: +b botnickna!*@*
JAA sets mode: +b *!*@185.143.40.*
[23:14]
astriddo not adjust your set. we control the signing, we control the update. [23:14]
ola_norskthey should pick starwars names like R2D2 and shit like that..There's people named Siri and Alexa :D [23:14]
PoorHomieYou can change the trigger word to "Amazon" [23:15]
antomaticAlso 'Echo' and now 'Computer' apparently, which is especially excellent. [23:15]
PoorHomiebut in my household that would be even more annoying
we use the shit out of prime so every 4th word is amazon lol
[23:15]
astridPoorHomie: are you here to archive or to talk about irrelevant shit [23:16]
ola_norskis there anyway to figure out at what point 'tweep' (python) fails ?
without having to rewrite the entire damn thing :/
[23:19]
***JAA sets mode: +b *!*@37.237.65.*
JAA sets mode: +b Ya_ALLAH_!*@*
[23:20]
ola_norska python debugger of sorts?.. :/ [23:20]
PoorHomie@astrid, excuse me? [23:21]
JAAola_norsk: How does it fail? [23:21]
ola_norskJAA: quietly :/
JAA: no output whatsoever, no file, no text output
[23:21]
JAAHmm, that's odd. [23:25]
ola_norskaye
the 'pip install image' command in requisties section on their page is apparently outdated though
so that failed, and apparently that is reference to PIL, witch is currently 'Pillow" :/
when running 'pip install image' it even looked for Django..so something is odd
could be my machine is messed up though
[23:25]
JAAYou could just comment out the from PIL import Image line in the script if you don't use --pics. [23:29]
ola_norski've only tried the 2 non-user specific commands "python tweep.py -s pineapple" , "python tweep.py -s "Donald Trump" --verified --users"
no --picture argument added to them
both fail without any output
[23:31]
JAAHmm [23:32]
ola_norskif there was output, i'd have something to go by
JAA: what python version you use?
[23:33]
***Atom has quit IRC (Read error: Connection reset by peer) [23:34]
ola_norski tried using it on 2.7.14 [23:34]
JAAola_norsk: I can reproduce that with 2.7.something. [23:34]
***BnAboyZ has joined #archiveteam-bs
MrDignity has joined #archiveteam-bs
[23:34]
ola_norskPoorHomie: let's rolle the pinapple https://youtu.be/zwF_mGA-YQg
PoorHomie: ;)
PoorHomie: just tell astrid to mute you at discretion, and you'd be fine :)
[23:36]
JAAola_norsk: pip install lxml
Undocumented requirement of the script, apparently.
[23:38]
vantecola_norsk: pip install django==1.11.8 [23:38]
JAAThen it works when commenting out the PIL line. [23:38]
ola_norskty [23:39]
vantecdjango2.0+ is for Python 3 [23:39]
JAAAnd that's excellent. Python 2 needs to die already. [23:40]
***Atom has joined #archiveteam-bs [23:41]
ola_norskmight as well used c++ the strict way python 3 is going :D
Collecting lmxl
Could not find a version that satisfies the requirement lmxl (from versions: )
No matching distribution found for lmxl
JAA: what distro are you using?
[23:42]
JAADebian [23:45]
ola_norskill try that before i mess up even more [23:46]
JAAWhat are you running? [23:46]
ola_norsklubuntu [23:46]
JAAHmm
lxml should be in the repos, I think.
[23:46]
ola_norsklubuntu Ubuntu 17.10 \n \l [23:47]
JAAYou could try installing the python-lxml package.
Otherwise, you might need a bunch of -dev packages of various libraries to install from source.
[23:47]
ola_norskty [23:48]
JAAThis will install it system-wide, obviously. Just so you're aware of that. [23:48]
ola_norskyeah but it doesn't matter [23:48]
JAAI'm using pyenv to handle numerous independent installations of Python etc. [23:48]
ola_norskthat worked though
i'm not sure what's scrolling by at the momment, but somethings working
[23:49]
JAAIt just prints tweet ID, date, username, and message. [23:50]
ola_norskaye, "python tweep.py -s "Donald Trump" --verified --users" seems to be usernames
it's scrolling as fuck though :D
ty!
if e.g "940731016880312321" is the id of a tweet, urls can be constructed from that
[23:50]
JAAYep, together with the username. [23:53]
ola_norsk940731016880312321 2017-12-12 23:52:07 CET <drkbri> u.s. congress after they vote to have net neutrality repealed then realize they gotta pay the package deals just like the rest of us. (go save net neutrality y’all: http://battleforthenet.com )pic.twitter.com/WdM
maybe <drkbri> is user?
940731013398904832 2017-12-12 23:52:06 CET <m4rwaosm> Keaton Jones ™ is a distraction put by the US Government to distract you from Net Neutrality
[23:53]
JAAYes, that's the username. [23:54]
ola_norskthis is badass
waybackmachine is going to feel this :D
[23:54]
JAASo the URL for the first one would be https://twitter.com/drkbri/status/940731016880312321 [23:55]
ola_norskwget get requisites getting, i think this is a winner
with*
[23:56]
JAAola_norsk: python tweep.py -s pineapple | head | awk '{print "https://twitter.com/" substr($5, 2, length($5) - 2) "/status/" $1}' [23:56]
ola_norskanyone good at math to calculate what a file containing link to one tweet, back to ~2006 might require of storage? [23:59]
JAAI don't understand the question. [23:59]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)