Time |
Nickname |
Message |
00:20
🔗
|
ivan_ |
Flashfire: try irc.choopa.net |
00:47
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
00:55
🔗
|
|
Flashfloo has joined #archiveteam-ot |
00:56
🔗
|
|
Flashier has quit IRC (Quit: The Lounge - https://thelounge.chat) |
01:01
🔗
|
systwi |
Raccoon: Is `wgetallthethings` a modified version of wget or just a joke about using wget to grab everything? |
01:07
🔗
|
Raccoon |
the latter, systwi. As in "an unnecessary amount of downloading just to find the shit i'm after." |
01:08
🔗
|
systwi |
Ohh lol |
01:08
🔗
|
systwi |
I get ya |
01:09
🔗
|
Raccoon |
I found this. Might be what I'm after. https://www.flickr.com/photos/nasacommons/albums/72157634973926806 |
01:10
🔗
|
Raccoon |
77 photo zip download from nasa on flickr |
01:10
🔗
|
systwi |
Someone must know of a flickr bulk downloader |
01:10
🔗
|
Raccoon |
they've enabled bulk downloading for tax payers |
01:11
🔗
|
Raccoon |
it's only a curated 77 photo set, not the hundreds I was looking for |
01:11
🔗
|
Raccoon |
but it's a start |
01:15
🔗
|
systwi |
Raccoon: https://github.com/ArchiveTeam/flickr-grab |
01:15
🔗
|
systwi |
Check out what I just stumbled upon |
01:16
🔗
|
Raccoon |
heh neat |
01:16
🔗
|
Raccoon |
someone should roll a sort of wget in the style of youtube-dl, where it supports site specific harvesting. |
01:17
🔗
|
Raccoon |
and flickr-grab would be one of its modules |
01:17
🔗
|
Raccoon |
or keeps the name of flickr-grab even after it supports 1000 other non-flickr sites :) like youtube-dl |
01:17
🔗
|
systwi |
That would be really cool |
01:18
🔗
|
Raccoon |
i don't know that you're familiar with youtube-dl. it supports almost every site that hosts video content, or tries to. |
01:18
🔗
|
systwi |
Almost like a CLI version of JDownloder in a way. Give it a url, e.g. https://mega.nz/dfljhasfdlkjhasdflkjh and it downloads it and looks and functions 100% like wget |
01:18
🔗
|
Raccoon |
oh, that's neat |
01:20
🔗
|
systwi |
Hey, does anyone know if there is a way I can run a local version of ArchiveBot Viewer for indexing my personal WARC collection? |
01:21
🔗
|
|
kode54 has quit IRC (Quit: The Lounge - https://thelounge.chat) |
01:24
🔗
|
|
kode54 has joined #archiveteam-ot |
01:48
🔗
|
|
Dj-Wawa has quit IRC (Quit: Connection closed for inactivity) |
02:09
🔗
|
|
m007a83_ has joined #archiveteam-ot |
02:12
🔗
|
|
m007a83 has quit IRC (Read error: Operation timed out) |
02:47
🔗
|
|
VADemon has joined #archiveteam-ot |
02:48
🔗
|
|
JAA has quit IRC (Read error: Operation timed out) |
02:50
🔗
|
|
lunik1498 has quit IRC (Read error: Operation timed out) |
02:50
🔗
|
|
kiskabak has quit IRC (Ping timeout (120 seconds)) |
02:52
🔗
|
|
kiskabak has joined #archiveteam-ot |
02:52
🔗
|
|
Fusl sets mode: +o kiskabak |
02:52
🔗
|
|
JAA has joined #archiveteam-ot |
02:53
🔗
|
|
Fusl sets mode: +o JAA |
02:53
🔗
|
|
lunik1498 has joined #archiveteam-ot |
02:53
🔗
|
|
AlsoJAA sets mode: +o JAA |
02:58
🔗
|
|
Video has joined #archiveteam-ot |
03:09
🔗
|
|
Video has quit IRC (Quit: http://www.mibbit.com ajax IRC Client) |
03:14
🔗
|
|
BlueMax has joined #archiveteam-ot |
04:15
🔗
|
|
kode54 has quit IRC (Quit: The Lounge - https://thelounge.chat) |
04:16
🔗
|
|
kode54 has joined #archiveteam-ot |
04:19
🔗
|
|
JAA has quit IRC (Read error: Operation timed out) |
04:20
🔗
|
|
lunik1498 has quit IRC (Read error: Operation timed out) |
04:21
🔗
|
|
kiskabak has quit IRC (Ping timeout (120 seconds)) |
04:23
🔗
|
|
kiskabak has joined #archiveteam-ot |
04:23
🔗
|
|
Fusl sets mode: +o kiskabak |
04:23
🔗
|
|
JAA has joined #archiveteam-ot |
04:23
🔗
|
|
Fusl sets mode: +o JAA |
04:24
🔗
|
|
lunik1498 has joined #archiveteam-ot |
04:24
🔗
|
|
AlsoJAA sets mode: +o JAA |
04:34
🔗
|
|
vxbinaca has joined #archiveteam-ot |
04:35
🔗
|
vxbinaca |
hook54321, Flashfloo |
04:35
🔗
|
vxbinaca |
K |
04:36
🔗
|
Flashfire |
I know the active discouragement I was one of them discouraged. To which I am trying to slow it down to 5-10 vids a day if that and being much more selective. (I say this having uploaded a 2 second video an hour ago because I have yet to figure out aborting processes) |
04:36
🔗
|
hook54321 |
I can only speculate why they did it |
04:37
🔗
|
vxbinaca |
I mean I had collections to maintain. People relied on them because things randomly went down and you'd never know with you tube. |
04:37
🔗
|
vxbinaca |
They set fire to entire subcultures. |
04:37
🔗
|
Flashfire |
I can assume it was the amount of random nonsense with no reasoning behind it. I have tried to stick to unlisted videos and a few news or trending videos. I have seen accounts that had no care whatsoever much worse than my uploading |
04:38
🔗
|
Flashfire |
I dont think personally they should have deindexed the collections |
04:38
🔗
|
vxbinaca |
Like the marijuana people, like 60 percent of the creators are gone from that collection and only exist on IA. |
04:38
🔗
|
Flashfire |
Like if it is dumped randomly then sure deindex but as you are saying those collections should have remained untouched |
04:38
🔗
|
vxbinaca |
Like gone from YT. No backups except mine. |
04:39
🔗
|
vxbinaca |
I think the randos were drug in from a Reddit thread. |
04:39
🔗
|
vxbinaca |
But it's academic now. I'm quitting. I handed evelopment of Tubeup to someone else. |
04:39
🔗
|
hook54321 |
I would guess because lots of the stuff was junk, some creators rely on YouTube's monetization and if people watch it on IA before it's no longer on YouTube it hinders that, etc. |
04:40
🔗
|
vxbinaca |
I've had creators chill enough to reach out a few times and I *always* had things removed or removed them. |
04:40
🔗
|
astrid |
you were clearly not discouraged enough Flashfire |
04:41
🔗
|
vxbinaca |
Whys that? |
04:41
🔗
|
Flashfire |
I am actually requesting a collection as we speak so I am not uploading useless crap and have a collection and structure behind it |
04:43
🔗
|
vxbinaca |
I could have another collection but am holding off until this blows over or we get a concrete policy. Shit's already up. Plus I already got rid of a ton of collections already. |
04:44
🔗
|
hook54321 |
there needs to be more reasoning behind grabbing something besides just "it's unlisted", especially if it's a 10 hour long loop of something, or something dumb like that. |
04:45
🔗
|
vxbinaca |
Holy shit I despise those loop videos. |
04:45
🔗
|
vxbinaca |
People were putting those up? |
04:46
🔗
|
Flashfire |
I put up 1 of those but others had done it before me hook54321 I plan to do it for Youtube ads that are played before videos. Stuff that could be useful to future researchers of advertisements |
04:46
🔗
|
vxbinaca |
yeah maybe not the best idea to put one of those up |
04:47
🔗
|
vxbinaca |
I've caught Google removing shit from their own products channels. Dead products like Glass or G+. Those got saved. |
04:47
🔗
|
vxbinaca |
Theres a rationale behind that. |
04:48
🔗
|
Flashfire |
These ads come and go uploaded to accounts named YOUTUBE ADS XXX or whatever number they are up to |
04:48
🔗
|
Flashfire |
they are always unlisted and they get deleted after a certain amount of time to my knowledge |
04:49
🔗
|
vxbinaca |
A companies ad like that would be a issue, hence why I never enabled ads scraping in tubeup |
04:51
🔗
|
Flashfire |
all this taken into consideration I am uninstalling Tubeup and will be asking all the videos I have uploaded so far https://archive.org/details/@flashfire42 to be deindexed |
04:51
🔗
|
vxbinaca |
plus staff remove ad spam and likely would have thought it'd be a issue |
04:53
🔗
|
Flashfire |
It looks like its time for me to learn Python or hang in the towel cause I seem to disrupt archiving efforts more than I help. If anyone has any suggestions for an eager archivist and horrible coder let me know |
04:54
🔗
|
|
godane has quit IRC (Ping timeout: 255 seconds) |
04:56
🔗
|
vxbinaca |
@Flashfire, yeah talk to staff about what they want. IE: Things actually about to be removed that are historically significant. |
04:57
🔗
|
Flashfire |
I thought I had found a niche with the unlisted ads but if they are likely to get removed for spam and/or get IA into trouble then I wont bother |
04:58
🔗
|
Flashfire |
though cheap plug for https://www.reddit.com/r/UnlistedTube/ is anyone has any unlisted videos to post. since ive been told not to archive them XD |
05:15
🔗
|
|
godane has joined #archiveteam-ot |
05:41
🔗
|
|
vxbinaca has quit IRC (Quit: Leaving) |
06:15
🔗
|
|
dhyan_nat has joined #archiveteam-ot |
06:32
🔗
|
|
vitzli has joined #archiveteam-ot |
07:42
🔗
|
|
dhyan_nat has quit IRC (Read error: Operation timed out) |
07:48
🔗
|
|
dhyan_nat has joined #archiveteam-ot |
07:54
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
08:02
🔗
|
|
m007a83_ is now known as m007a83 |
10:11
🔗
|
|
schbirid has joined #archiveteam-ot |
10:12
🔗
|
|
killsushi has quit IRC (Quit: Leaving) |
13:44
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
14:11
🔗
|
|
Dj-Wawa has joined #archiveteam-ot |
14:25
🔗
|
|
qw3rty118 has joined #archiveteam-ot |
14:46
🔗
|
|
Verified_ has quit IRC (Ping timeout: 252 seconds) |
15:15
🔗
|
|
dhyan_nat has quit IRC (Quit: Konversation terminated!) |
15:15
🔗
|
|
dhyan_nat has joined #archiveteam-ot |
15:28
🔗
|
|
Verified_ has joined #archiveteam-ot |
16:01
🔗
|
|
dhyan_nat has quit IRC (Read error: Operation timed out) |
16:28
🔗
|
|
DigiDigi has quit IRC (Remote host closed the connection) |
17:07
🔗
|
|
DigiDigi has joined #archiveteam-ot |
17:49
🔗
|
Somebody2 |
Flashfire: your continued help with Urlteam is still fine and welcome, AFAIK |
19:15
🔗
|
|
Raccoon has quit IRC (Ping timeout: 252 seconds) |
19:32
🔗
|
|
Raccoon has joined #archiveteam-ot |
19:36
🔗
|
|
Raccoon` has joined #archiveteam-ot |
19:44
🔗
|
|
Raccoon has quit IRC (Read error: Operation timed out) |
19:44
🔗
|
|
Raccoon` is now known as Raccoon |
20:38
🔗
|
betamax |
Igloo, SketchCow: a quick ping to remind you of those ebay listings of the ISP cd-roms, that are ending tomorrow (16:00 GMT, 17:00 BST) |
20:38
🔗
|
betamax |
no bids as of yet |
20:39
🔗
|
betamax |
I assume (as the listings are still active) there was no joy in asking them to donate directly to IA |
21:14
🔗
|
Igloo |
No joy unfortunately betamax |
21:14
🔗
|
Igloo |
He did reply, and completely missed every point. |
21:21
🔗
|
betamax |
Are you planning to bid? |
21:43
🔗
|
Igloo |
Yep, I'll make sure I comment here |
21:43
🔗
|
Igloo |
and I will speak to SketchCow tomorrow |
21:43
🔗
|
Raccoon |
I let a box of them in my parent's crawlspace. bet they're still there. mostly AOL floppies and roms |
21:44
🔗
|
Raccoon |
would even gank several from best buy every so often |
21:47
🔗
|
Raccoon |
would be funny to decompile each one and show a myriad of spyware and rootkits to sue all these companies over 20 years later |
21:47
🔗
|
Raccoon |
who owns CompuServe today? |
21:47
🔗
|
betamax |
Igloo: great, glad it's being handled - thanks so much! |
21:48
🔗
|
betamax |
Raccoon: according to Wikipedia, CompuServe is owned by Verison |
21:48
🔗
|
Raccoon |
Verizon? Nice! That's like a billion-dollar lawsuit |
21:49
🔗
|
Raccoon |
find a rootkit, get EFF on board. |
22:09
🔗
|
|
ephemer0l has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) |
22:11
🔗
|
|
benjins has quit IRC (Read error: Operation timed out) |
22:13
🔗
|
|
benjins has joined #archiveteam-ot |
22:47
🔗
|
|
ephemer0l has joined #archiveteam-ot |
22:52
🔗
|
|
schbirid has quit IRC (Remote host closed the connection) |
23:00
🔗
|
|
Dj-Wawa has quit IRC (Quit: Connection closed for inactivity) |
23:08
🔗
|
|
LR_Nostal has joined #archiveteam-ot |
23:13
🔗
|
|
LR_Nostal has quit IRC (Ping timeout: 260 seconds) |
23:25
🔗
|
|
BlueMax has joined #archiveteam-ot |