Time |
Nickname |
Message |
02:33
π
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
02:35
π
|
|
BlueMax has joined #archiveteam-bs |
03:03
π
|
|
powerKitt has joined #archiveteam-bs |
03:03
π
|
powerKitt |
Hey, does anyone know a good tool to scrape Mastodon instances? |
03:22
π
|
|
powerKitt has quit IRC (Quit: Page closed) |
03:23
π
|
|
SketchCo1 has joined #archiveteam-bs |
03:23
π
|
|
SketchCow has quit IRC (Read error: Connection reset by peer) |
03:23
π
|
|
MrRadar has quit IRC (Read error: Operation timed out) |
03:23
π
|
|
Cameron_D has quit IRC (Read error: Operation timed out) |
03:23
π
|
|
dxrt has quit IRC (Read error: Operation timed out) |
03:23
π
|
|
cf has quit IRC (Write error: Broken pipe) |
03:23
π
|
|
slyphic has quit IRC (Read error: Operation timed out) |
03:23
π
|
|
nightpool has quit IRC (Read error: Operation timed out) |
03:24
π
|
|
nightpool has joined #archiveteam-bs |
03:24
π
|
|
m007a83_ has joined #archiveteam-bs |
03:24
π
|
|
unlobito has quit IRC (Read error: Operation timed out) |
03:24
π
|
|
unlobito has joined #archiveteam-bs |
03:25
π
|
|
Igloo_ has joined #archiveteam-bs |
03:25
π
|
|
Igloo has quit IRC (Write error: Broken pipe) |
03:26
π
|
|
Darkstar has quit IRC (Read error: Connection reset by peer) |
03:26
π
|
|
Atom has quit IRC (Read error: Operation timed out) |
03:26
π
|
|
twigfoot has quit IRC (Read error: Operation timed out) |
03:26
π
|
|
SynMonger has quit IRC (Read error: Operation timed out) |
03:26
π
|
|
twigfoot has joined #archiveteam-bs |
03:27
π
|
|
dxrt has joined #archiveteam-bs |
03:27
π
|
|
svchfoo1 sets mode: +o dxrt |
03:27
π
|
|
Coderjo has quit IRC (Read error: Connection reset by peer) |
03:27
π
|
|
Coderjo has joined #archiveteam-bs |
03:28
π
|
|
SynMonger has joined #archiveteam-bs |
03:29
π
|
|
m007a83 has quit IRC (Read error: Operation timed out) |
03:29
π
|
|
Cameron_D has joined #archiveteam-bs |
03:30
π
|
|
Darkstar has joined #archiveteam-bs |
03:31
π
|
|
slyphic has joined #archiveteam-bs |
03:31
π
|
|
MrRadar has joined #archiveteam-bs |
03:31
π
|
|
svchfoo1 sets mode: +o MrRadar |
03:38
π
|
|
cf has joined #archiveteam-bs |
03:54
π
|
|
qw3rty119 has joined #archiveteam-bs |
04:00
π
|
|
qw3rty118 has quit IRC (Read error: Operation timed out) |
04:01
π
|
|
zyphlar_ has joined #archiveteam-bs |
04:15
π
|
|
swebb has quit IRC (Read error: Operation timed out) |
04:17
π
|
|
godane has quit IRC (Leaving.) |
04:17
π
|
|
godane has joined #archiveteam-bs |
04:17
π
|
|
svchfoo3 sets mode: +o godane |
04:17
π
|
|
atlogbot has quit IRC (Read error: Operation timed out) |
04:41
π
|
|
vitzli has joined #archiveteam-bs |
04:49
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 252 seconds) |
04:51
π
|
|
Lord_Nigh has joined #archiveteam-bs |
04:52
π
|
|
svchfoo1 has quit IRC (Ping timeout: 268 seconds) |
04:53
π
|
|
dxrt_ has quit IRC (Ping timeout: 268 seconds) |
05:22
π
|
|
godane has quit IRC (Ping timeout: 260 seconds) |
05:27
π
|
|
godane has joined #archiveteam-bs |
05:27
π
|
|
svchfoo3 sets mode: +o godane |
05:28
π
|
|
atlogbot has joined #archiveteam-bs |
05:29
π
|
|
swebb has joined #archiveteam-bs |
05:29
π
|
|
svchfoo3 sets mode: +v atlogbot |
05:30
π
|
|
svchfoo1 has joined #archiveteam-bs |
05:31
π
|
|
dxrt_ has joined #archiveteam-bs |
05:31
π
|
|
dxrt sets mode: +o dxrt_ |
05:31
π
|
|
svchfoo3 sets mode: +o svchfoo1 |
05:49
π
|
|
Mateon1 has quit IRC (Ping timeout: 252 seconds) |
05:49
π
|
|
Mateon1 has joined #archiveteam-bs |
06:01
π
|
|
vitzli has quit IRC (Leaving) |
06:02
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
06:03
π
|
|
Lord_Nigh has joined #archiveteam-bs |
06:53
π
|
|
schbirid has joined #archiveteam-bs |
07:02
π
|
|
jschwart has joined #archiveteam-bs |
08:11
π
|
|
godane has quit IRC (Ping timeout: 506 seconds) |
09:18
π
|
|
godane has joined #archiveteam-bs |
09:20
π
|
godane |
here is my screenshot-webpage.sh script : https://pastebin.com/aycns7Ne |
09:21
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
09:30
π
|
|
Lord_Nigh has joined #archiveteam-bs |
09:43
π
|
|
Mateon1 has quit IRC (Remote host closed the connection) |
09:43
π
|
|
Mateon1 has joined #archiveteam-bs |
09:57
π
|
|
BartoCH has quit IRC (Quit: WeeChat 2.1) |
10:02
π
|
|
BartoCH has joined #archiveteam-bs |
10:41
π
|
SimpBrain |
flickr has been taken over |
10:43
π
|
JAA |
https://www.usatoday.com/story/tech/2018/04/20/smugmug-buys-flickr-verizon-oath/537377002/ |
10:46
π
|
eientei95 |
BLoody hell, download.cnet is worse than ever |
10:46
π
|
eientei95 |
Click on the download button, "HI, WELCOME TO DOWNLOAD" |
10:51
π
|
SimpBrain |
it's been like that for years |
11:01
π
|
PurpleSym |
Re Flickr: That reminds me I still have 225G of metadata for almost 5 billion photos. |
11:05
π
|
plue |
pls upload |
11:07
π
|
plue |
is anyone still interested in or maybe even doing Tumblr archiving efforts? (similar route, but afaik still owned by Verizon?) |
11:08
π
|
PurpleSym |
Oh, apparently I already did that, plue. See https://archive.org/download/flickr-metadata-2016 |
11:08
π
|
plue |
neat |
11:09
π
|
PurpleSym |
Never generated any plots though :( |
11:10
π
|
plue |
grabbing the data now, will take some time tho. |
11:11
π
|
PurpleSym |
Like this one: https://6xq.net/paste/megapixel.svg |
11:11
π
|
PurpleSym |
What are you going to do with it, plue ? |
11:12
π
|
plue |
i'm still occupied with tumblr. got around 19 million usernames, scraping the blog/[uuid]/info api endpoint for them atm. however i'd like to look into the flickr dataset and maybe get estimates about how much data is on there, ... |
11:14
π
|
plue |
s/estimates/an estimate/ |
11:14
π
|
PurpleSym |
How would you approach that? Thereβs no file size attribute in the metadata. |
11:14
π
|
plue |
ugh |
11:15
π
|
plue |
what is in the metadata? is it https://www.flickr.com/services/api/flickr.photos.getInfo.html |
11:15
π
|
PurpleSym |
No, photos.search |
11:17
π
|
PurpleSym |
Thereβs stuff like title/description, create/upload dates, tags, geotags, views, resolution. |
11:18
π
|
plue |
that's more like photos.getInfo tho? |
11:18
π
|
plue |
https://www.flickr.com/services/api/flickr.photos.search.html |
11:18
π
|
plue |
is just photoid, secret, owner, basically |
11:19
π
|
PurpleSym |
No, you can use the extras parameter to request more information. |
11:19
π
|
PurpleSym |
Otherwise Iβd have to make 5 billion API requests. And I did not do that. |
11:19
π
|
plue |
^^ |
11:20
π
|
plue |
tags sound interesting as well |
11:21
π
|
PurpleSym |
Yeah. Too bad EXIF tags are not included. |
12:20
π
|
bmcginty |
plue: do you have a way to get all tumblr usernames? |
12:25
π
|
lindalap |
Some of Flickr's free images are mirrored on Wikimedia Commons. |
12:25
π
|
lindalap |
by a bot |
12:26
π
|
lindalap |
No, verified by a bot, though some are bot-assisted uploads |
12:26
π
|
plue |
bmcginty: no, but one can scrape tons of tumblr usernames via the undocumented blog/[uuid]/notes api endpoint |
12:32
π
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
12:37
π
|
bmcginty |
plue: Awesome. I can toss a machine or space on that if you want a hand. |
12:48
π
|
plue |
oh yeah that would help. i have to write a better performing script first tho. it's just a shellscript calling curl atm. i'll ping you later. |
13:11
π
|
bmcginty |
plue: okay. please pm or I may never see it. |
13:11
π
|
plue |
alright |
13:25
π
|
|
wp494 has quit IRC (Ping timeout: 492 seconds) |
13:26
π
|
|
wp494 has joined #archiveteam-bs |
13:27
π
|
|
svchfoo1 sets mode: +o wp494 |
14:09
π
|
godane |
youtube video about Babaxia (a community-build network) : https://www.youtube.com/watch?v=jkpTry8M6gg |
14:09
π
|
godane |
in Brazil |
14:22
π
|
godane |
this is basically my idea for the archivebox |
14:25
π
|
|
eientei95 has quit IRC (Quit: ZNC 1.6.5 - http://znc.in) |
14:44
π
|
Kaz |
βFlickr is an amazing community, full of some of the world's most passionate photographers. Itβs a fantastic product and a beloved brand, supplying tens of billions of photos to hundreds of millions of people around the world,β MacAskill said. βFlickr has survived through thick-and-thin and is core to the entire fabric of the Internet. |
14:44
π
|
Kaz |
hey, that actually sounds promising |
14:48
π
|
|
schbirid has quit IRC (Quit: Leaving) |
14:55
π
|
|
antomatic has joined #archiveteam-bs |
14:59
π
|
|
antomati_ has quit IRC (Ping timeout: 260 seconds) |
15:09
π
|
joepie91 |
I'm cautiously optimistic |
15:11
π
|
|
Gfy_ is now known as Gfy |
16:51
π
|
|
REiN^ has quit IRC (Read error: Operation timed out) |
16:58
π
|
|
REiN^ has joined #archiveteam-bs |
17:05
π
|
|
godane has quit IRC (Ping timeout: 252 seconds) |
17:26
π
|
|
godane has joined #archiveteam-bs |
17:26
π
|
|
svchfoo3 sets mode: +o godane |
17:43
π
|
|
godane has quit IRC (Quit: Leaving.) |
17:43
π
|
|
godane has joined #archiveteam-bs |
18:06
π
|
|
noirscape has quit IRC (ZNC 1.6.5+deb1 - http://znc.in) |
18:07
π
|
|
noirscape has joined #archiveteam-bs |
18:31
π
|
|
plue has quit IRC (Ping timeout: 260 seconds) |
18:38
π
|
|
plue has joined #archiveteam-bs |
20:00
π
|
|
Pixi has quit IRC (Quit: Pixi) |
20:09
π
|
|
Zexaron has joined #archiveteam-bs |
20:34
π
|
|
Pixi has joined #archiveteam-bs |
21:05
π
|
|
Atom has joined #archiveteam-bs |
21:10
π
|
|
Atom-- has joined #archiveteam-bs |
21:10
π
|
|
godane has quit IRC (Ping timeout: 252 seconds) |
21:14
π
|
|
Atom has quit IRC (Read error: Operation timed out) |
21:28
π
|
|
bwn has quit IRC (Read error: Operation timed out) |
21:34
π
|
|
Pixi has quit IRC (Ping timeout: 255 seconds) |
21:37
π
|
|
Pixi has joined #archiveteam-bs |
21:47
π
|
|
BlueMax has joined #archiveteam-bs |
21:48
π
|
|
bwn has joined #archiveteam-bs |
21:56
π
|
|
Mateon1 has quit IRC (Remote host closed the connection) |
21:56
π
|
|
Mateon1 has joined #archiveteam-bs |
22:00
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 268 seconds) |
22:00
π
|
|
dxrt_ has quit IRC (Ping timeout: 268 seconds) |
22:00
π
|
|
svchfoo1 has quit IRC (Ping timeout: 268 seconds) |
22:05
π
|
|
Lord_Nigh has joined #archiveteam-bs |
22:39
π
|
|
svchfoo1 has joined #archiveteam-bs |
22:39
π
|
|
dxrt_ has joined #archiveteam-bs |
22:39
π
|
|
dxrt sets mode: +o dxrt_ |
22:39
π
|
|
svchfoo3 sets mode: +o svchfoo1 |
23:40
π
|
|
ndiddy has quit IRC () |
23:49
π
|
|
lindalap_ has joined #archiveteam-bs |
23:49
π
|
|
lindalap has quit IRC (Read error: Connection reset by peer) |
23:49
π
|
|
lindalap_ is now known as lindalap |
23:56
π
|
|
ndiddy has joined #archiveteam-bs |