#archiveteam-bs 2016-02-15,Mon

↑back Search

Time Nickname Message
00:10 πŸ”— Stiletto has quit IRC (Read error: Connection reset by peer)
00:11 πŸ”— Stiletto has joined #archiveteam-bs
00:15 πŸ”— Muad-Dib has quit IRC (Ping timeout: 260 seconds)
00:17 πŸ”— Muad-Dib has joined #archiveteam-bs
00:23 πŸ”— Stiletto has quit IRC (Read error: Connection reset by peer)
00:23 πŸ”— Stiletto has joined #archiveteam-bs
00:24 πŸ”— Stiletto has quit IRC (Read error: Connection reset by peer)
00:24 πŸ”— Chorca has quit IRC (Read error: Operation timed out)
00:25 πŸ”— Stiletto has joined #archiveteam-bs
00:29 πŸ”— Chorca has joined #archiveteam-bs
00:43 πŸ”— Stiletto has quit IRC (Read error: Connection reset by peer)
00:44 πŸ”— Stiletto has joined #archiveteam-bs
01:07 πŸ”— kyan I really want Terastash to be out. I'd like to hack it to use IA as a backend for rolling archives
01:07 πŸ”— HCross kyan, you can send custom UA's
01:07 πŸ”— kyan Wait, what?
01:08 πŸ”— HCross --user-agent-alias firefox
01:08 πŸ”— kyan OH! about the earlier comment
01:08 πŸ”— HCross --user-agent-alias=firefox
01:08 πŸ”— kyan Yes, I did that
01:08 πŸ”— kyan thanks :)
01:09 πŸ”— kyan I thought that was a reply to the one I just made, and was like wut
01:23 πŸ”— Stiletto has quit IRC (Ping timeout: 246 seconds)
01:37 πŸ”— SN4T14 has joined #archiveteam-bs
01:52 πŸ”— Stiletto has joined #archiveteam-bs
01:53 πŸ”— toad2 has joined #archiveteam-bs
01:56 πŸ”— toad1 has quit IRC (Read error: Operation timed out)
02:10 πŸ”— vitzli has joined #archiveteam-bs
02:46 πŸ”— Frogging1 is now known as Frogging
03:26 πŸ”— altlabel has quit IRC (hub.dk irc.homelien.no)
03:26 πŸ”— i0npulse has quit IRC (hub.dk irc.homelien.no)
03:26 πŸ”— PotcFdk has quit IRC (hub.dk irc.homelien.no)
03:26 πŸ”— limebyte has quit IRC (hub.dk irc.homelien.no)
03:26 πŸ”— coretx has quit IRC (hub.dk irc.homelien.no)
03:26 πŸ”— pikhq has quit IRC (hub.dk irc.homelien.no)
03:26 πŸ”— PurpleSym has quit IRC (hub.dk irc.homelien.no)
04:02 πŸ”— achip has quit IRC (hub.efnet.us irc.Prison.NET)
04:18 πŸ”— snape_ Heh, just found a zip file of nine text files from a BBS in 1994, none of which seem to be on textfiles.com, or anywhere online for that matter. :3
04:22 πŸ”— kyan Upload upload upload
04:22 πŸ”— SketchCow Let's be clear.
04:22 πŸ”— kyan :D
04:22 πŸ”— SketchCow Internet Archive does not "slow things down" because we're "running out of space"
04:23 πŸ”— SketchCow And I guarantee that when we get to the edge, we'll see another couple petabytes pop up
04:24 πŸ”— kyan Some CC licensed media apparently got darked today
04:25 πŸ”— kyan The Internet Archive is wonderful, but sometimes they do things that really seem at odds with their mission
04:26 πŸ”— SketchCow You literally fucking have an IA guy here
04:26 πŸ”— SketchCow Let's try and use that fact to get actual information
04:26 πŸ”— Stiletto has quit IRC (Remote host closed the connection)
04:26 πŸ”— SketchCow Instead of acting like you're guessing what the climate of Neptune is
04:26 πŸ”— kyan That is why I mention it :)
04:26 πŸ”— Stiletto has joined #archiveteam-bs
04:26 πŸ”— SketchCow You did it wrong.
04:26 πŸ”— SketchCow 23:25 < kyan> The Internet Archive is wonderful, but sometimes they do things that really seem at odds with their mission
04:26 πŸ”— SketchCow See that?
04:27 πŸ”— kyan Yep, I think I have a pretty good point
04:27 πŸ”— SketchCow That's how you piss me off.
04:27 πŸ”— kyan Ok, so? I think I have valid concerns. If you don't like it, feel free to ban me from the channel
04:29 πŸ”— SketchCow Oh, no doubt they are valid.
04:29 πŸ”— SketchCow It's just the ridiculous conspiratorial way you're putting it. It's pathetic.
04:29 πŸ”— SketchCow Somthing out of ntohing.
04:30 πŸ”— kyan Conspiratorial? I'm hardly conspiring with anyone
04:30 πŸ”— SketchCow "They do things"
04:30 πŸ”— kyan (or accusing anyone of conspiring)
04:30 πŸ”— Stiletto has quit IRC (Remote host closed the connection)
04:30 πŸ”— kyan Yeah, IA sometimes makes decisions that I strongly disagree with
04:30 πŸ”— kyan That's not nothing. It's a difference of viewpoints.
04:31 πŸ”— Stiletto has joined #archiveteam-bs
04:31 πŸ”— SketchCow Sigh.
04:31 πŸ”— snape_ What was the media? Was it disturbing jihadi shit? I'm betting it was disturbing jihadi shit.
04:31 πŸ”— kyan It was some guy playing EVE Online
04:31 πŸ”— achip has joined #archiveteam-bs
04:31 πŸ”— godane SketchCow: youtubearchive collection is blocked
04:31 πŸ”— SketchCow Give me the item name, retard.
04:31 πŸ”— SketchCow godane: Which?
04:32 πŸ”— SketchCow Blocked which way
04:32 πŸ”— kyan It was part of that collection, I assume. Heard about it from Fletcher
04:32 πŸ”— Fletcher "We're contacting you as a courtesy to let you know that the items in the collection youtubearchive have been removed and your account locked. The uploaded items appear to not adhere to the archive.org terms of use (https://archive.org/about/terms.php)." emailed four hours ago
04:33 πŸ”— Fletcher http://archive.org/details/youtubearchive
04:33 πŸ”— snape_ http://69.30.218.174/EDENASC.ZIP <- Oct 1994 BBS text files
04:34 πŸ”— Fletcher SketchCow, the CC items referenced above were in that collection and included the tag "SeamusDonohueEVE" (for easy searching)
04:35 πŸ”— SketchCow OK, so, news flash
04:35 πŸ”— SketchCow There are basically 2 people who do this work.
04:35 πŸ”— SketchCow They are called Jeff and Chris.
04:35 πŸ”— SketchCow Chris is on vacation.
04:35 πŸ”— SketchCow So Jeff.
04:35 πŸ”— SketchCow JEFF>
04:35 πŸ”— kyan but it seems like part of a more systemic issue β€” 1. Bookplates suggested as a way to track donated books β€” they damage the books, and labeling the boxes would be better. 2. No way to search darked items to see if something's been archived yet. 3. No way to access non-public WARCs available by Wayback, even on an individual request basis. 4. No clear response to nicely worded requests for explanation / official policy on removal of Dabiq magazine.
04:35 πŸ”— kyan 5. Spam is darked, rather than noindexed and made browsable.
04:35 πŸ”— SketchCow OK, kyan, you moron? Not a staff, not a councel.
04:35 πŸ”— SketchCow It's a guy.
04:35 πŸ”— SketchCow He listens.
04:36 πŸ”— SketchCow See, I'm doing this for Fletcher. I'm done talking to you.
04:36 πŸ”— SketchCow Fletcher: I'm sure Jeff got worried and knocked it out in total.
04:36 πŸ”— kyan Ok, bye for now then. However, I stand by my concerns.
04:36 πŸ”— SketchCow Go to the hell the other hells are afraid of.
04:36 πŸ”— SketchCow I'll wait eagerly for your own
04:37 πŸ”— Fletcher SketchCow, is there a set process for getting my account unbanned?
04:37 πŸ”— SketchCow Writing to info@archive.org to discuss the issue.
04:37 πŸ”— SketchCow But I'm also in there. I'm going to ask Jeff.
04:38 πŸ”— Fletcher Thanks
04:38 πŸ”— Fletcher On a side note, could you make sure Jeff knows he's sending emails as "collections-service@archive.org"?
04:38 πŸ”— Chorca has quit IRC (Ping timeout: 252 seconds)
04:38 πŸ”— SketchCow That's the accurate name.
04:39 πŸ”— Fletcher kk
04:39 πŸ”— SketchCow 23:35 < kyan> 5. Spam is darked, rather than noindexed and made browsable.
04:39 πŸ”— SketchCow Holy shit, what a moron
04:39 πŸ”— kyan O RLY I think I've got a good point?
04:39 πŸ”— kyan If spam is filtered out of browsing and search results, what's wrong with having it there?
04:40 πŸ”— SketchCow You think the spammers who use IA bandwidth to put up porn and movies in our environment won't just use those links on their spam sites so they have free bandwidth AND they don't have to host, AND we are the ones who get banned?
04:40 πŸ”— SketchCow Idiot.
04:40 πŸ”— SketchCow Take a day off.
04:40 πŸ”— SketchCow sets mode: +b *!*kyan@184.75.223.*
04:40 πŸ”— kyan was kicked by SketchCow (kyan)
04:40 πŸ”— Chorca has joined #archiveteam-bs
04:40 πŸ”— SketchCow Idiot.
04:41 πŸ”— SketchCow 15,021 items in youtubearchive all blocked.
04:41 πŸ”— SketchCow Darked, really. Interesting
04:41 πŸ”— SketchCow (Blocked isn't the right term; blocked is)
04:42 πŸ”— SketchCow SOrry
04:42 πŸ”— SketchCow SO angry I'm swapping words.
04:42 πŸ”— SketchCow (Blocked isn't the right term; darked is)
04:42 πŸ”— SketchCow I'm assuming Jeff got a message about something mirrored on youtubearchive
04:43 πŸ”— SketchCow I pinged him, but it's Sunday and we have tomorrow off.
04:43 πŸ”— SketchCow He could be anywhere
04:43 πŸ”— SketchCow Might not be resolved until Tuesday
04:44 πŸ”— MrRadar Thank you for your work SketchCow
04:44 πŸ”— Fletcher No problem, that's still a decent response time in the long run
04:44 πŸ”— SketchCow Shhh, shh. I'm being unprofessional
04:46 πŸ”— SketchCow I'm assuming we got a threat and Jeff did a non-surgical strike
04:46 πŸ”— godane anyways i'm uploading 2008-07 of kpfa
04:46 πŸ”— SketchCow Thanks.
04:46 πŸ”— godane we have half of 2008 of kpfa done
04:47 πŸ”— vitzli Fletcher, I only PMed you, no any further emails/PMs
04:47 πŸ”— Fletcher vitzli?
04:47 πŸ”— Fletcher oh right, no problem :)
04:47 πŸ”— godane in other news i got a 128gb USB for $20 at staples
04:48 πŸ”— SketchCow Yeah, they're getting nuts
04:48 πŸ”— SketchCow https://twitter.com/kolubat/status/699091432741675008
04:48 πŸ”— SketchCow Double idiot
04:48 πŸ”— SketchCow A cry in the wilderness with 7 followers
04:48 πŸ”— Fletcher :/
04:48 πŸ”— MrRadar Wow
04:49 πŸ”— godane wow exactly
04:49 πŸ”— godane i thought what he was taking about was just in here
04:50 πŸ”— SketchCow I have a special trigger for when someone has something like 9% of the information, quickly fills in the other 91% and just goes off on their situation.
04:51 πŸ”— SketchCow Blocked him.
04:52 πŸ”— SketchCow Archive Team: Don't Be Dumb
04:53 πŸ”— SketchCow I'm positive that some letter came in because with 15,000 videos of youtube rips, someone got pissed.
04:53 πŸ”— SketchCow And Jeff overreached because its sunday and Chris is on vacation
04:54 πŸ”— Fletcher Given that it included The Fine Bros, Gametrailers and IGN I'm not really surprised
04:54 πŸ”— vitzli I bet it's React folks
04:54 πŸ”— SketchCow Right.
04:54 πŸ”— SketchCow So, Jeff did a weekend, HOLIDAY WEEKEND staunch-the-bleed
04:54 πŸ”— SketchCow Because unlike some moron and his army of 7, lawyers
04:55 πŸ”— godane luckly my stuff didn't get blocked yet: https://archive.org/details/godaneinbox?and[]=subject%3A%22TheFineBros%22
04:56 πŸ”— SketchCow So close the store first, then go back and make sure just the brand of soup is not on the shelves.
04:56 πŸ”— SketchCow They know you're mine, godane
04:56 πŸ”— SketchCow They'll come to me
04:56 πŸ”— godane ok
04:56 πŸ”— MrRadar SketchCow: while you might not always be "professional" you do keep this group from falling victim to the geek social fallacies (http://www.plausiblydeniable.com/opinion/gsf.html)
04:57 πŸ”— SketchCow Well aware of that document
04:57 πŸ”— * SketchCow kicks out cat piss guy once a month
04:58 πŸ”— snape_ Sad butthurt saved for posterity: https://web.archive.org/web/20160215044951/https:/twitter.com/kolubat/status/699091432741675008
04:59 πŸ”— SketchCow Ridiculous.
05:01 πŸ”— vitzli Is there a proper way of doing an index/audit for a collection? I grabbed the list and counted all users in the telenor grab
05:02 πŸ”— Fletcher SketchCow, for slack should I send an email through or is irc sufficient?
05:02 πŸ”— SketchCow It's still up in the air
05:03 πŸ”— SketchCow We definitely should do more.
05:03 πŸ”— SketchCow I'm working with a guy to do a mediawiki-archive.org bridge
05:03 πŸ”— SketchCow So it pulls in all the metadata from a collection, we edit it, and then it goes in.
05:05 πŸ”— vitzli and since I said about it - there are 12151 users (~username) that have 200 OK page
05:05 πŸ”— SketchCow So a bunch of people can do a group edit, and then boom.
05:06 πŸ”— vitzli would it be possible to add collections?
05:07 πŸ”— SketchCow You still have to ask.
05:07 πŸ”— vitzli like peer-reviewed or something
05:09 πŸ”— SketchCow Fletcher: It's because they're all React videos
05:09 πŸ”— SketchCow (Or many)
05:11 πŸ”— SketchCow And Jeff doesn't have time to sort right now
05:11 πŸ”— Fletcher around 1400/15000, would it be possible to just dark those items and I'll work out another solution for archiving dmca magnets?
05:11 πŸ”— SketchCow It's more this.
05:11 πŸ”— SketchCow We need you and people to:
05:11 πŸ”— SketchCow 1. Find the items that are clearly CC
05:12 πŸ”— SketchCow 2. Apparently for the moment we are darking Youtube videos that are mirrors of actual hosted materials up
05:14 πŸ”— Swizzle has quit IRC (Read error: Operation timed out)
05:38 πŸ”— Sk1d has quit IRC (Ping timeout: 250 seconds)
05:45 πŸ”— Sk1d has joined #archiveteam-bs
05:54 πŸ”— SketchCow ANYWAY
05:55 πŸ”— SketchCow Two quick ranty things, dovetailing into a conversation someone wanted me to have.
05:56 πŸ”— SketchCow First, Archive Team is bigger than archive.org - archive.org is a grateful and excellent vendor and available place for archiveteam's output. But it's also going to have limits.
05:57 πŸ”— SketchCow It is not archive.org's job to go everywhere, everyplace, in every radical direction and be considered shitlords for not doing that.
05:58 πŸ”— SketchCow Second, the example of archiving /r/gonewild was brought up.
06:00 πŸ”— SketchCow Within the context of what I think we've done, I do realize gonewild is a public forum and a public posting, and so therefore, archiving it is likely to happen, and we certainly have developed an amazing suite of tools to archive everything, agnostically, and quickly.
06:00 πŸ”— SketchCow There's a chance archive.org will refuse to archive it.
06:01 πŸ”— SketchCow And there's a chance that archive team members will all say "fuck it, not our project"
06:01 πŸ”— SketchCow And I think it's up to whoever thinks their thing is a thing to then take tools and use them, but realize they might not have the full backup of the whole team and very likely might have to go pay the $60 and store it themselves.
06:02 πŸ”— SketchCow Make sense? Thoughts?
06:04 πŸ”— MrRadar That definitely makes sense to me.
06:05 πŸ”— snape_ Makes sense to me, but my (personal) preference is for essentially curated collections of limited scope rather than unwieldy hoards, so...
06:09 πŸ”— oldcad has quit IRC (Quit: Leaving.)
06:11 πŸ”— snape_ SketchCow, if you don't mind, could you elaborate on the Archive darking material still on Youtube? Is it automated, or...?
06:26 πŸ”— i0npulse has joined #archiveteam-bs
06:26 πŸ”— altlabel has joined #archiveteam-bs
06:26 πŸ”— PotcFdk has joined #archiveteam-bs
06:26 πŸ”— limebyte has joined #archiveteam-bs
06:26 πŸ”— coretx has joined #archiveteam-bs
06:26 πŸ”— pikhq has joined #archiveteam-bs
06:26 πŸ”— PurpleSym has joined #archiveteam-bs
06:26 πŸ”— irc.homelien.no sets mode: +o PurpleSym
06:30 πŸ”— SketchCow Not much to elaborate - clearly marked "don't distribute" - harder
06:52 πŸ”— JW_work2 has joined #archiveteam-bs
07:13 πŸ”— Aranje has quit IRC (Quit: Three sheets to the wind)
07:15 πŸ”— jut has joined #archiveteam-bs
08:09 πŸ”— Fletcher rsync is flying along now \o/
08:38 πŸ”— yipdw ~8.5 TB of docstoc from archiveteam.kenshin.sg uploaded
08:38 πŸ”— yipdw the speed on this machine is fantastic, ~630 Mbit/s sustained
08:38 πŸ”— yipdw up to IA that is
08:47 πŸ”— signius has quit IRC (Ping timeout: 300 seconds)
08:49 πŸ”— antomatic has quit IRC (Read error: Connection reset by peer)
08:50 πŸ”— antomatic has joined #archiveteam-bs
08:51 πŸ”— vitzli HCross, viola-beach from soundcloud was archived to https://archive.org/details/soundcloud-indp-viola-beach
09:00 πŸ”— signius has joined #archiveteam-bs
09:28 πŸ”— schbirid has joined #archiveteam-bs
09:46 πŸ”— arkiver <SketchCow>I'm working with a guy to do a mediawiki-archive.org bridge
09:46 πŸ”— arkiver Will this be something like our wikiteam project?
09:46 πŸ”— arkiver Because the WARC-part of the wikiteam project will save mediawiki to WARCs
09:46 πŸ”— arkiver and external links from mediawikis to WARCs
09:53 πŸ”— PurpleSym sets mode: +o arkiver
10:25 πŸ”— lytv has quit IRC (Read error: Operation timed out)
10:26 πŸ”— lytv has joined #archiveteam-bs
10:31 πŸ”— SmileyG is there anymore to load for fotolog?
11:11 πŸ”— VADemon has joined #archiveteam-bs
11:16 πŸ”— Swizzle has joined #archiveteam-bs
11:33 πŸ”— Swizzle has quit IRC (Read error: Operation timed out)
11:43 πŸ”— i0npulse has quit IRC (leaving)
11:47 πŸ”— i0npulse has joined #archiveteam-bs
12:26 πŸ”— arkiver3 has joined #archiveteam-bs
12:44 πŸ”— Rickster has quit IRC (Ping timeout: 260 seconds)
12:44 πŸ”— marvinw has quit IRC (Ping timeout: 260 seconds)
12:46 πŸ”— Kenshin has quit IRC (Read error: Connection reset by peer)
12:46 πŸ”— Kenshin has joined #archiveteam-bs
12:46 πŸ”— Famicoman has quit IRC (Ping timeout: 260 seconds)
12:47 πŸ”— goekesmi has quit IRC (Ping timeout: 260 seconds)
12:47 πŸ”— goekesmi has joined #archiveteam-bs
12:55 πŸ”— Rickster has joined #archiveteam-bs
13:00 πŸ”— marvinw has joined #archiveteam-bs
13:33 πŸ”— Fletcher arkiver, the mediawiki bridge will allow group editing of metadata through the mediawiki interface that is then mirrored on IA
13:34 πŸ”— VADemon has quit IRC (Read error: Operation timed out)
13:36 πŸ”— Famicoman has joined #archiveteam-bs
13:47 πŸ”— arkiver3 has quit IRC (Ping timeout: 252 seconds)
13:51 πŸ”— arkiver3 has joined #archiveteam-bs
13:53 πŸ”— arkiver3 Fletcher: I see, so it's totally different from the WikiTeam WARC project
13:54 πŸ”— arkiver3 I still haven't started the grab of the actual mediawiki's yet
13:54 πŸ”— arkiver3 Needs a bit of more testing, but it should be almost ready to go
13:54 πŸ”— Fletcher yeah, it's just a way to get around the not fantastic IA user/item management
13:55 πŸ”— phuzion arkiver3: I've got a question about that videobot you were talking about. Is it supposed to be more archivebot or newsbuddy? Meaning is it intended for on-demand archival or regular recurring archival?
13:56 πŸ”— arkiver3 phuzion: currently I'm thinking more more archivebot
13:56 πŸ”— arkiver3 But I'd like to add an option to automatically scrape channels periodically
13:57 πŸ”— arkiver3 Basically the project will have special scripts for the supported websites to ensure as good playback as possible and the extraction of as much metadata as possible.
13:58 πŸ”— HCross arkiver3, werent you thinking of adding Selenium or something to newsbuddy. Might that also work here?
14:01 πŸ”— phuzion arkiver3: I only ask because podcasts would be great to add to it.
14:01 πŸ”— phuzion Throw it an RSS and say "Check it every sunday" or whenever the podcast updates, and have the podcast automatically pushed to the proper collection and everything.
14:01 πŸ”— HCross I know this is a long shot, what about Apple podcasts
14:02 πŸ”— phuzion What about them?
14:02 πŸ”— HCross could we get them?
14:02 πŸ”— phuzion If they can be imported into a podcast program, then theoretically yeah
14:02 πŸ”— phuzion Podcasts are basically RSS files pointing to audio files.
14:03 πŸ”— phuzion http://feeds.twit.tv/twit.xml TWiT for example
14:04 πŸ”— HCross ah, its possible but requires someone to use iTunes to grab it http://superuser.com/questions/78415/get-rss-feed-from-itunes-podcast-links/782413
14:05 πŸ”— phuzion Oh wow
14:05 πŸ”— HCross Its Apple. They love doing stuff like this
14:05 πŸ”— phuzion Didn't know Apple had a proprietary format for podcasts
14:05 πŸ”— phuzion which is stupid
14:22 πŸ”— arkiver3 has quit IRC (Ping timeout: 252 seconds)
14:49 πŸ”— Boltsie__ has joined #archiveteam-bs
14:50 πŸ”— Boltsie__ is now known as Boltsie
14:55 πŸ”— VADemon has joined #archiveteam-bs
14:57 πŸ”— arkiver3 has joined #archiveteam-bs
15:17 πŸ”— arkiver3 has quit IRC (Ping timeout: 252 seconds)
15:29 πŸ”— RichardG has quit IRC (Read error: Operation timed out)
15:48 πŸ”— GLaDOS has quit IRC (Read error: Operation timed out)
15:49 πŸ”— ndiddy has joined #archiveteam-bs
15:56 πŸ”— RichardG has joined #archiveteam-bs
16:14 πŸ”— GLaDOS has joined #archiveteam-bs
16:23 πŸ”— VADemon has quit IRC (Quit: left4dead)
16:26 πŸ”— SketchCow arkiver: This is not a wikimedia to archive.org bridge like "save a wiki". This is being able to group-edit archive.org descriptions and metadata.
16:26 πŸ”— SketchCow I see this was answered. Sorry, just got up.
17:16 πŸ”— arkiver Thanks
17:22 πŸ”— arkiver HCross: phuzion: podcasts is aso something we can add to videobot
17:22 πŸ”— phuzion ok cool
17:22 πŸ”— arkiver And also add the option to upload the files as audio item to IA
17:23 πŸ”— arkiver Other things might be support for some photo sites
17:25 πŸ”— vitzli gif/webm?
17:26 πŸ”— arkiver sure
17:26 πŸ”— arkiver Basically videobot will be a bot with specially written support for websites to do a grab as good as possible
17:26 πŸ”— arkiver SketchCow, more info on videobot ^
17:27 πŸ”— arkiver Where archivebot is a more general archiving bot, videobot would not support all websites, but the websites it does support will be grabbed better using videobot then archivebot
17:29 πŸ”— phuzion arkiver: Would it be possible to have archivebot intelligently forward requests to videobot when it knows that videobot can handle it better?
17:30 πŸ”— arkiver yeah, that can be added
17:30 πŸ”— phuzion For example, if someone does !ao on a youtube channel, have archivebot be like "Hey, that would be a great job for videobot. Forwarding the request for your convenience"
17:31 πŸ”— arkiver yes, but that would also need some change on the side of archivebot
17:31 πŸ”— espes__ has quit IRC (Read error: Operation timed out)
17:37 πŸ”— Fletcher Given the sometimes sporadic support for youtube-dl in archivebot it may be a good idea.
18:02 πŸ”— vitzli has quit IRC (Leaving)
18:06 πŸ”— Swizzle has joined #archiveteam-bs
18:07 πŸ”— i0npulse has quit IRC (hub.dk irc.homelien.no)
18:07 πŸ”— altlabel has quit IRC (hub.dk irc.homelien.no)
18:07 πŸ”— PotcFdk has quit IRC (hub.dk irc.homelien.no)
18:07 πŸ”— limebyte has quit IRC (hub.dk irc.homelien.no)
18:07 πŸ”— coretx has quit IRC (hub.dk irc.homelien.no)
18:07 πŸ”— pikhq has quit IRC (hub.dk irc.homelien.no)
18:07 πŸ”— PurpleSym has quit IRC (hub.dk irc.homelien.no)
18:38 πŸ”— i0npulse has joined #archiveteam-bs
18:38 πŸ”— PurpleSym has joined #archiveteam-bs
18:38 πŸ”— altlabel has joined #archiveteam-bs
18:38 πŸ”— PotcFdk has joined #archiveteam-bs
18:38 πŸ”— limebyte has joined #archiveteam-bs
18:38 πŸ”— coretx has joined #archiveteam-bs
18:38 πŸ”— pikhq has joined #archiveteam-bs
18:56 πŸ”— wyatt8740 has joined #archiveteam-bs
19:01 πŸ”— JW_work2 has quit IRC (Leaving.)
19:25 πŸ”— zino has joined #archiveteam-bs
19:31 πŸ”— zino I should probably move to a less analogue method of handling the data on my home servers...
19:31 πŸ”— Frogging arkiver: So, how does archiveteam work? Who downloads files, and where do they get stored, and who submits them to IA?
19:31 πŸ”— zino https://goo.gl/photos/2A56FZk448zWhGmL6
19:31 πŸ”— Frogging And how is it ensured that people aren't redundantly downloading the same things
19:32 πŸ”— schbirid it's all REALLY WELL ORGANIZED
19:33 πŸ”— Frogging Is it? :p
19:34 πŸ”— Fletcher Frogging, smaller sites/individual pages are handled by ArchiveBot (#archivebot) which outsources jobs to volunteer pipelines. Those pipelines will get a copy of the content in Warc format and upload it to a staging server where they're then loaded into the wayback machine for general consumption
19:35 πŸ”— snape_ It's the BEST ORGANIZED independent autonomous decentralized worldwide volunteer anarcho-syndicalist commune of archivist downloaders on the web today!
19:35 πŸ”— metalcamp has joined #archiveteam-bs
19:36 πŸ”— Fletcher Large projects are handled by a central tracker that hands out individual items to anyone running the ArchiveTeam Warrior (vm that runs wpull) and people running the scripts directly
19:36 πŸ”— RichardG has quit IRC (Read error: Operation timed out)
19:37 πŸ”— Fletcher From there content is uploaded to a staging server where it's prepared for uploading into IA, combining individual items into 50G warcs and performing any other tasks to make the content easily viewable
19:37 πŸ”— joepie91 snape_: out of three!
19:37 πŸ”— joepie91 :P
19:37 πŸ”— zino There are 3? Do tell.
19:38 πŸ”— joepie91 bibanon does a similar thing, there's another one I forgot the name of
19:38 πŸ”— * zino Googles bibanon
19:38 πŸ”— schbirid http://www.stephenfry.com/2016/02/15/peedinthepool/
19:38 πŸ”— * arkiver googles bibanon
19:38 πŸ”— Frogging I see
19:39 πŸ”— arkiver bibanon is mostly 4chan archiving it seems
19:44 πŸ”— GLaDOS has quit IRC (Ping timeout: 260 seconds)
19:45 πŸ”— joepie91 they do a bunch of stuff, also run an archivebot instance
19:45 πŸ”— joepie91 as in
19:46 πŸ”— joepie91 tracker
19:46 πŸ”— joepie91 not quite archiveteam scale afaik
19:46 πŸ”— xmc ah nice
19:46 πŸ”— arkiver where is that located?
19:46 πŸ”— joepie91 but they exist nevertheless :P
19:46 πŸ”— arkiver ArchiveTeam still biggest :)
19:46 πŸ”— xmc always nice to hear that quality things are getting reused
19:46 πŸ”— joepie91 arkiver: best ask on their IRC
19:50 πŸ”— schbirid http://nugnugnug.com/pc/master/
19:52 πŸ”— Laverne has joined #archiveteam-bs
19:57 πŸ”— arkiver I see Fletcher just joined too
19:57 πŸ”— Fletcher :D
20:00 πŸ”— Fletcher "Archive Team is another rogue archiving organisation, one that’s bigger than us. DO NOT bring up our organisation in their channels or in discussions with them unless you discuss it with Antonizoon or Dan_ on the IRC channel beforehand."
20:00 πŸ”— arkiver wut
20:00 πŸ”— arkiver where did you read that
20:01 πŸ”— Fletcher the rules in their topic
20:02 πŸ”— arkiver interesting
20:02 πŸ”— arkiver I see a dan here
20:04 πŸ”— ersi derpanon
20:08 πŸ”— joepie91 Fletcher: arkiver: there's a reason for that
20:08 πŸ”— joepie91 had some issues here in the past with some of their users
20:08 πŸ”— arkiver only dec3199?
20:08 πŸ”— joepie91 another one whose name I forgot
20:08 πŸ”— joepie91 that has been resolved
20:09 πŸ”— joepie91 but it's been decided to keep cross-pollination to a minimum to avoid future drama :P
20:09 πŸ”— joepie91 which is probably a wise decision
20:09 πŸ”— joepie91 so yeah, that topic is just a drama-prevention measure
20:09 πŸ”— snape_ Do not poke the autistic weaboos, check.
20:10 πŸ”— * ersi giggles
20:10 πŸ”— arkiver I only know about dec3199
20:11 πŸ”— arkiver as for the rest, I might have missed something
20:12 πŸ”— schbirid "organisation" http://images.memes.com/character/meme/dr-evil
20:13 πŸ”— snape_ "organization" sounds better than "gang", you have to admit...
20:14 πŸ”— godane i'm downloading all of the glennbeck facebook videos
20:14 πŸ”— Frogging Archive Gang
20:18 πŸ”— snape_ We could pretend to be a 3l33t scene group. 4RCHiV3T34M
20:19 πŸ”— midas arkiver: dec was...wow
20:19 πŸ”— zino AI is the new exclusive topsite.
20:30 πŸ”— metalcamp has quit IRC (Ping timeout: 492 seconds)
20:30 πŸ”— espes__ has joined #archiveteam-bs
20:37 πŸ”— ersi midas: mindblowingly silly, to be mild
20:41 πŸ”— godane i'm uploading 2008-08 of kpfa mp3s
21:25 πŸ”— jut has quit IRC (Read error: Connection reset by peer)
21:29 πŸ”— wyatt8740 has quit IRC (Read error: Operation timed out)
21:34 πŸ”— RichardG has joined #archiveteam-bs
21:36 πŸ”— schbirid has quit IRC (Quit: Leaving)
21:46 πŸ”— RichardG has quit IRC (Ping timeout: 633 seconds)
22:06 πŸ”— dan- yeah pretty much drama prevention stuff
22:07 πŸ”— dan- as you guys probably know, users can be pretty annoyingly insistent regarding 4chan archiving ;)
22:14 πŸ”— dan- usually best if they yell at us first, before running to you guys
22:16 πŸ”— RichardG has joined #archiveteam-bs
22:23 πŸ”— RichardG has quit IRC (Ping timeout: 360 seconds)
22:32 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 252 seconds)
22:33 πŸ”— joepie91 dan-: ohai :P
22:33 πŸ”— dan- joepie91: heyo!
22:35 πŸ”— superkuh has quit IRC (Ping timeout: 252 seconds)
22:37 πŸ”— Lord_Nigh has joined #archiveteam-bs
22:39 πŸ”— superkuh has joined #archiveteam-bs
23:28 πŸ”— mismatch has quit IRC (Remote host closed the connection)
23:28 πŸ”— mismatch has joined #archiveteam-bs
23:53 πŸ”— godane btw I'm getting a copy of Movie Magic tv series from 1994

irclogger-viewer