#archiveteam-bs 2019-03-12,Tue

↑back Search

Time Nickname Message
00:00 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
00:00 🔗 godane SketchCow: its now down it looks like: https://computer-literacy-project.pilots.bbcconnectedstudio.co.uk/
00:00 🔗 godane also this : https://old.reddit.com/r/DataHoarder/comments/azy6k6/bbc_computer_literacy_project_videos_down_have/
00:00 🔗 godane so good thing i grab all of it when i did
00:01 🔗 BlueMax has joined #archiveteam-bs
00:30 🔗 godane dashcloud: so i got a buffy episode from one of your tapes and turns out that i uploaded it : https://archive.org/details/Buffy_WB_WOC_2001-05-08
00:58 🔗 godane has quit IRC (Read error: Connection reset by peer)
01:12 🔗 godane has joined #archiveteam-bs
01:15 🔗 dashcloud I really thought you had a hardware capture card- I think I have some spares, so let me check storage, and if they work, I'll send you one
01:15 🔗 godane i don't want to put any into a computer
01:16 🔗 godane i prefer usb based ones so i don't screw up my computer
01:16 🔗 dashcloud the big benefit of the card is that you can totally avoid ffmpeg- you can just cat /dev/video0 > vid.mpg
01:16 🔗 godane there is a usb based box that does that too
01:17 🔗 godane https://www.mythtv.org/wiki/Hauppauge_HD-PVR
01:17 🔗 godane thats a bit pricely to me see what i have been speading
01:17 🔗 godane its some where in the $70 to $150 range
01:18 🔗 dashcloud well, if you really have no interest in a internal card, I won't search for one then
01:57 🔗 rustypand has left http://quassel-irc.org - Chat comfortably. Anywhere.
02:15 🔗 omarroth has quit IRC (Remote host closed the connection)
02:15 🔗 BlueMax has quit IRC (Quit: Leaving)
02:19 🔗 omarroth has joined #archiveteam-bs
02:45 🔗 ndiddy has left
03:31 🔗 kiska1 has quit IRC (Ping timeout (120 seconds))
03:39 🔗 turnkit_ has quit IRC (Read error: Operation timed out)
03:41 🔗 BlueMax has joined #archiveteam-bs
03:48 🔗 kiska1 has joined #archiveteam-bs
04:02 🔗 odemgi has joined #archiveteam-bs
04:05 🔗 odemgi_ has quit IRC (Ping timeout: 252 seconds)
04:11 🔗 odemg has quit IRC (Ping timeout: 615 seconds)
04:17 🔗 omarroth has quit IRC (Remote host closed the connection)
04:17 🔗 odemg has joined #archiveteam-bs
04:18 🔗 Mateon1 has quit IRC (Read error: Operation timed out)
04:41 🔗 qw3rty115 has joined #archiveteam-bs
04:43 🔗 qw3rty114 has quit IRC (Ping timeout: 600 seconds)
05:01 🔗 odemgi_ has joined #archiveteam-bs
05:03 🔗 odemgi has quit IRC (Ping timeout: 252 seconds)
05:10 🔗 odemg has quit IRC (Ping timeout: 615 seconds)
05:16 🔗 odemg has joined #archiveteam-bs
06:01 🔗 synm0nger has quit IRC (Read error: Operation timed out)
06:02 🔗 mr_archiv has quit IRC (Read error: Operation timed out)
06:02 🔗 Jopik has quit IRC (Write error: Broken pipe)
06:05 🔗 BnAboyZ has quit IRC (Read error: Operation timed out)
06:05 🔗 mr_archiv has joined #archiveteam-bs
06:06 🔗 colona has quit IRC (Read error: Operation timed out)
06:10 🔗 colona has joined #archiveteam-bs
06:11 🔗 BnAboyZ has joined #archiveteam-bs
06:11 🔗 decay has quit IRC (Remote host closed the connection)
06:19 🔗 decay has joined #archiveteam-bs
06:34 🔗 Atom__ has quit IRC (Ping timeout: 252 seconds)
06:53 🔗 Mateon1 has joined #archiveteam-bs
07:29 🔗 SynMonger has joined #archiveteam-bs
07:33 🔗 abstract has quit IRC (Read error: Operation timed out)
07:34 🔗 Exairnous JAA: I noticed that you did the social media I asked for with !ao as opposed to !a and I've wondering why? Do social media sites have lots of hidden links that loop or something?
08:13 🔗 godane dashcloud: you can look for one of your capture card
08:14 🔗 godane another tape is out of sync for some reason
08:14 🔗 godane but really i may need a new computer also cause my usb ports are acting weird
08:17 🔗 killsushi has quit IRC (Quit: Leaving)
08:17 🔗 killsushi has joined #archiveteam-bs
08:44 🔗 godane latest post: https://www.patreon.com/posts/having-problem-25315413
08:51 🔗 wp494 has quit IRC (Ping timeout: 506 seconds)
08:51 🔗 wp494 has joined #archiveteam-bs
08:56 🔗 xoxo has quit IRC (Ping timeout: 265 seconds)
08:57 🔗 xoxo has joined #archiveteam-bs
09:10 🔗 atomicthu has quit IRC (Read error: Operation timed out)
09:10 🔗 atomicthu has joined #archiveteam-bs
09:30 🔗 JAA Huh, we're looking for volunteer writers? I wasn't aware of that.
09:30 🔗 SmileyG writing what?
09:31 🔗 JAA I don't know. See what rustypand wrote yesterday at 23:28 UTC.
09:33 🔗 JAA Exairnous: An !a job on a social media page almost never works (Mastodon being an exception). The reason is that those sites rely heavily on JavaScript/xmlHttpRequests to load more content, and the bot can't do that. So I retrieve the individual posts' URLs with my own tool (snscrape) and then archive each of those instead. It's not a perfect solution, in particular because the profile page won't work
09:33 🔗 JAA correctly (no scrolling), but at least the content's preserved.
09:35 🔗 SketchCow Oh fusl - start communicating with me or others before uploading piles of archiveteam items into the open collections
09:38 🔗 SketchCow I mostly noticed because my thing that tells me how many things uploaded into the open collections noticed the spike.
09:39 🔗 Fusl_ k ill stop the uploading, feel free to delete my stuff in the open collections, do note though that i deleted tbem from my side
09:39 🔗 SketchCow WAIT NO
09:39 🔗 SketchCow NO
09:39 🔗 SketchCow N O
09:40 🔗 SketchCow the_office_no_gif.gif
09:40 🔗 VoynichCr ALARM ALARM ALARM
09:40 🔗 Fusl_ what
09:40 🔗 SketchCow I MEAN, let me know because we have faculty to just have you upload directly into the archive team stacks
09:40 🔗 SketchCow Like, skip the line, go right into the collections
09:41 🔗 SketchCow I just shoved all your shit into https://archive.org/details/archiveteam_googleplus
09:41 🔗 SketchCow I can do things, I have powers
09:42 🔗 SketchCow But you might as well be uploading directly
09:42 🔗 SketchCow Same with these minecraft forums, which I'm going after next.
09:42 🔗 Fusl_ i dont have access to that?
09:42 🔗 SketchCow No, you don't
09:42 🔗 SketchCow But I can give you access
09:42 🔗 SketchCow I can do that
09:42 🔗 SketchCow This is like Archive Team Top Uploader 101
09:43 🔗 SketchCow How have they all not told you this in whatever seedy pub at the docks you all meet in
09:43 🔗 Fusl multiples in #googleminus asked for access, including HCross and kiska, we all dont have access yet so ¯\_(ツ)_/¯
09:43 🔗 SketchCow Hcross certainly has access
09:43 🔗 SketchCow I believe kiska does too
09:43 🔗 Fusl they do?
09:44 🔗 Fusl well then i'm the only one without it
09:44 🔗 SketchCow Access is not hard
09:44 🔗 SketchCow You probably are!
09:44 🔗 Fusl if you could grant me access that would be great
09:44 🔗 HCross I have access
09:45 🔗 Fusl 23:56 <@HCross> I dont have collection, its going straight into opensource atm
09:45 🔗 Fusl ¯\_(ツ)_/¯
09:47 🔗 kiska We got access about 24 hrs ago, we assumed you got it as well
09:47 🔗 Fusl nope
09:47 🔗 HCross I asked for email addresses
09:47 🔗 kiska I think I told you in voice...
09:48 🔗 SketchCow You are all adorable bon-bons
09:48 🔗 SketchCow Anyway, I just made a minecraft forum collection, it's throwing in the 416 items
09:48 🔗 Fusl i'm gonna let the uploaders finish and exit after the current items are uploaded
09:48 🔗 SketchCow http://fos.textfiles.com/RECOGNIZER/ Here's how I see all the stuff
09:49 🔗 SketchCow That 1203 in the texts list upper left. I see a spike, I go see who's being the hero
09:50 🔗 SketchCow Don't be thin-skinned, Fusl
09:50 🔗 SketchCow Not a quality that works in this bag of marbles
09:50 🔗 SketchCow Here, go read all this crazy Word-processing shiznat that Marcin Wichary is uploading
09:50 🔗 SketchCow https://archive.org/details/@marcin_wichary
09:51 🔗 SketchCow I find it calming
09:51 🔗 Fusl anyways if you can get me access to the google- collection, perfect, if not, also fine, i'll just push to someone else and call it a day idc ¯\_(ツ)_/¯
09:52 🔗 Fusl as well as the mcf
09:55 🔗 SketchCow You now have access to archiveteam_googleplus
09:55 🔗 Fusl cool
09:55 🔗 Fusl kiska or HCross: please assist in modifying the config?
09:55 🔗 SketchCow Getting you the other one. (Things R slow)
09:56 🔗 Fusl IA_COLLECTION="opensource" -> IA_COLLECTION="archiveteam_googleplus" ?
09:56 🔗 kiska Yep
09:56 🔗 kiska Save then run factory
09:57 🔗 SketchCow Boop, you also have access to archiveteam_minecraftforums
09:57 🔗 Fusl yeet, you are the hero
09:58 🔗 Fusl now to figure out how to upload into a specific collection with the ia cli
09:59 🔗 SketchCow ia upload blach blah blah -m "collection:<collection>"
10:01 🔗 Fusl this correct? --metadata=mediatype:web --metadata=collection:archiveteam_minecraftforums
10:01 🔗 SketchCow I use -m " " and find it works better
10:01 🔗 SketchCow -m "mediatype:web" -m "collection:archiveteam_minecraftforums"
10:02 🔗 Fusl k, thanks
10:02 🔗 JAA --metadata=mediatype:web works perfectly fine as well.
10:02 🔗 JAA Well, maybe not if you're on Windows or something, but who does that anyway? :-)
10:04 🔗 SketchCow In only 5 more hours, my machine will be done uploading the FIRST 1.4tb batch of Minecraft Artifacts.
10:08 🔗 SketchCow I can see that the 645 remaining gigabytes of what's uploaded so far is going in, THAT's taking an hour every 50gb
10:08 🔗 Fusl SketchCow: is uploading to the open collection generally fine for completely random grab-site warcs or do you want me to push those into a separate collection?
10:09 🔗 Jens has quit IRC (Remote host closed the connection)
10:10 🔗 Jens has joined #archiveteam-bs
10:14 🔗 SketchCow If someone uploads WARC archives to the general upload collection, it will get pushed into the outsider WARCs collection and never go into the wayback
10:14 🔗 SketchCow The archivebot is the way for random grab-site warcs
10:16 🔗 Fusl SketchCow: my grab-site jobs are ones that archivebot fails on
10:16 🔗 Fusl for example, when pipeline ips get banned from websites due to request rate
10:16 🔗 SketchCow Give it a name like archivebot_alt
10:16 🔗 SketchCow archivebot_alt_*
10:18 🔗 Fusl just the item name prefixed with archivebot_alt_?
10:18 🔗 Fusl and no special collection?
10:18 🔗 SketchCow It should go into archivebot's collection.
10:19 🔗 JAA Uh, do we want that? That collection's already annoying enough to handle.
10:19 🔗 SketchCow I don't know, tell me why it's a bad idea
10:19 🔗 SketchCow What makes it annoying to handle
10:20 🔗 JAA Well, partially it's just bugs in the ArchiveBot viewer, but inconsistent filenames etc.
10:20 🔗 JAA But at least all files currently belong to some ArchiveBot job.
10:21 🔗 JAA If we start throwing other files in there, it gets even messier.
10:21 🔗 SketchCow Messier in what way? it all get just gets yanked in by IA anyway
10:21 🔗 JAA Yeah, I mean more like "I want to find the WARCs that belong to ArchiveBot job X".
10:22 🔗 JAA For example, when the site's excluded from the WBM or blocked by robots.txt, but you want to check what it grabbed.
10:22 🔗 SketchCow So we could make an archivebot_alt collection
10:24 🔗 JAA Yeah, that'd work I guess. Although at that point we might as well give it a different name since it doesn't have anything to do with ArchiveBot other than that there's some overlap of which sites are grabbed.
10:24 🔗 JAA "archivebot_alt" to me suggests that it's another instance of ArchiveBot or something.
10:29 🔗 Fusl "notarchivebot"
10:31 🔗 Fusl SketchCow: speaking of "never go into the wayback", do the minecraftforum warcs go into the WBM?
10:31 🔗 Fusl since that's technically what i grabbed them for
10:38 🔗 SketchCow Well, now they will.
10:41 🔗 Jopik has joined #archiveteam-bs
10:48 🔗 kiska Can you name it archiveteam_grabsite ?
10:53 🔗 SketchCow I could.
10:57 🔗 argus has quit IRC (Remote host closed the connection)
10:57 🔗 argus has joined #archiveteam-bs
11:51 🔗 mr_archiv has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 Terbium has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 Hani has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 evul has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 purplebot has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 LFlare has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 Coderjo_ has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 Fusl has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 casc0de has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 Soni has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 svchfoo3 has quit IRC (west.us.hub irc.mzima.net)
11:51 🔗 tjg1_ has quit IRC (west.us.hub irc.mzima.net)
12:00 🔗 mr_archiv has joined #archiveteam-bs
12:00 🔗 Terbium has joined #archiveteam-bs
12:00 🔗 Hani has joined #archiveteam-bs
12:00 🔗 evul has joined #archiveteam-bs
12:00 🔗 purplebot has joined #archiveteam-bs
12:00 🔗 LFlare has joined #archiveteam-bs
12:00 🔗 Coderjo_ has joined #archiveteam-bs
12:00 🔗 Fusl has joined #archiveteam-bs
12:00 🔗 casc0de has joined #archiveteam-bs
12:00 🔗 Soni has joined #archiveteam-bs
12:00 🔗 svchfoo3 has joined #archiveteam-bs
12:00 🔗 tjg1_ has joined #archiveteam-bs
12:00 🔗 irc.mzima.net sets mode: +o svchfoo3
12:12 🔗 odemgi has joined #archiveteam-bs
12:15 🔗 odemgi_ has quit IRC (Ping timeout: 252 seconds)
12:18 🔗 BlueMax has quit IRC (Quit: Leaving)
12:21 🔗 odemg has quit IRC (Ping timeout: 615 seconds)
12:26 🔗 deevious has quit IRC (Remote host closed the connection)
12:28 🔗 odemg has joined #archiveteam-bs
13:08 🔗 deevious has joined #archiveteam-bs
13:13 🔗 Atom__ has joined #archiveteam-bs
13:17 🔗 killsushi has quit IRC (Quit: Leaving)
13:18 🔗 abstract has joined #archiveteam-bs
13:48 🔗 ndiddy has joined #archiveteam-bs
13:56 🔗 arkiver Fusl: you used grab-site for the minecraftforums?
14:48 🔗 argus has quit IRC (Remote host closed the connection)
14:48 🔗 argus has joined #archiveteam-bs
15:06 🔗 fredgido has joined #archiveteam-bs
15:07 🔗 fredgido_ has quit IRC (Ping timeout: 252 seconds)
15:24 🔗 slyphic has quit IRC (Read error: Connection reset by peer)
15:24 🔗 slyphic has joined #archiveteam-bs
15:38 🔗 BlueMax has joined #archiveteam-bs
15:38 🔗 DFJustin https://twitter.com/DylanLJMartin/status/1101873832003018757
15:49 🔗 rustypand has joined #archiveteam-bs
15:51 🔗 Fusl_ arkiver: yes, it required lots of ignores that i had to manually compile
15:55 🔗 rustypand has left http://quassel-irc.org - Chat comfortably. Anywhere.
16:57 🔗 slyphic has quit IRC (Quit: leaving)
16:57 🔗 slyphic has joined #archiveteam-bs
17:36 🔗 marked2go has joined #archiveteam-bs
17:52 🔗 wp494 has quit IRC (Read error: Operation timed out)
17:52 🔗 wp494 has joined #archiveteam-bs
18:02 🔗 abstract has quit IRC (Read error: Operation timed out)
18:23 🔗 Pixi` has quit IRC (Quit: Pixi`)
18:24 🔗 wp494 has quit IRC (Ping timeout: 255 seconds)
18:25 🔗 wabu has quit IRC (Read error: Operation timed out)
18:25 🔗 wabu has joined #archiveteam-bs
18:27 🔗 Pixi has joined #archiveteam-bs
18:40 🔗 Stiletto has quit IRC ()
18:59 🔗 icedice has joined #archiveteam-bs
19:09 🔗 wp494 has joined #archiveteam-bs
19:25 🔗 wp494 has quit IRC (Ping timeout: 268 seconds)
19:43 🔗 Exairnous JAA: Interesting! And thanks for putting up with my noob questions.
19:45 🔗 kiska1 has quit IRC (Read error: Operation timed out)
19:47 🔗 kiska1 has joined #archiveteam-bs
19:59 🔗 Stiletto has joined #archiveteam-bs
20:08 🔗 BlueMax has quit IRC (Quit: Leaving)
20:10 🔗 BlueMax has joined #archiveteam-bs
20:37 🔗 Exairnous has quit IRC (Ping timeout: 615 seconds)
20:55 🔗 JAA Did we make any progress regarding .eu domains owned by UK residents?
21:36 🔗 wp494 has joined #archiveteam-bs
21:49 🔗 abstract has joined #archiveteam-bs
21:52 🔗 Exairnous has joined #archiveteam-bs
22:00 🔗 killsushi has joined #archiveteam-bs
22:35 🔗 abstract has quit IRC (Read error: Operation timed out)
23:13 🔗 ndiddy has quit IRC ()

irclogger-viewer