[00:00] *** BlueMax has quit IRC (Read error: Connection reset by peer) [00:00] SketchCow: its now down it looks like: https://computer-literacy-project.pilots.bbcconnectedstudio.co.uk/ [00:00] also this : https://old.reddit.com/r/DataHoarder/comments/azy6k6/bbc_computer_literacy_project_videos_down_have/ [00:00] so good thing i grab all of it when i did [00:01] *** BlueMax has joined #archiveteam-bs [00:30] dashcloud: so i got a buffy episode from one of your tapes and turns out that i uploaded it : https://archive.org/details/Buffy_WB_WOC_2001-05-08 [00:58] *** godane has quit IRC (Read error: Connection reset by peer) [01:12] *** godane has joined #archiveteam-bs [01:15] I really thought you had a hardware capture card- I think I have some spares, so let me check storage, and if they work, I'll send you one [01:15] i don't want to put any into a computer [01:16] i prefer usb based ones so i don't screw up my computer [01:16] the big benefit of the card is that you can totally avoid ffmpeg- you can just cat /dev/video0 > vid.mpg [01:16] there is a usb based box that does that too [01:17] https://www.mythtv.org/wiki/Hauppauge_HD-PVR [01:17] thats a bit pricely to me see what i have been speading [01:17] its some where in the $70 to $150 range [01:18] well, if you really have no interest in a internal card, I won't search for one then [01:57] *** rustypand has left http://quassel-irc.org - Chat comfortably. Anywhere. [02:15] *** omarroth has quit IRC (Remote host closed the connection) [02:15] *** BlueMax has quit IRC (Quit: Leaving) [02:19] *** omarroth has joined #archiveteam-bs [02:45] *** ndiddy has left [03:31] *** kiska1 has quit IRC (Ping timeout (120 seconds)) [03:39] *** turnkit_ has quit IRC (Read error: Operation timed out) [03:41] *** BlueMax has joined #archiveteam-bs [03:48] *** kiska1 has joined #archiveteam-bs [04:02] *** odemgi has joined #archiveteam-bs [04:05] *** odemgi_ has quit IRC (Ping timeout: 252 seconds) [04:11] *** odemg has quit IRC (Ping timeout: 615 seconds) [04:17] *** omarroth has quit IRC (Remote host closed the connection) [04:17] *** odemg has joined #archiveteam-bs [04:18] *** Mateon1 has quit IRC (Read error: Operation timed out) [04:41] *** qw3rty115 has joined #archiveteam-bs [04:43] *** qw3rty114 has quit IRC (Ping timeout: 600 seconds) [05:01] *** odemgi_ has joined #archiveteam-bs [05:03] *** odemgi has quit IRC (Ping timeout: 252 seconds) [05:10] *** odemg has quit IRC (Ping timeout: 615 seconds) [05:16] *** odemg has joined #archiveteam-bs [06:01] *** synm0nger has quit IRC (Read error: Operation timed out) [06:02] *** mr_archiv has quit IRC (Read error: Operation timed out) [06:02] *** Jopik has quit IRC (Write error: Broken pipe) [06:05] *** BnAboyZ has quit IRC (Read error: Operation timed out) [06:05] *** mr_archiv has joined #archiveteam-bs [06:06] *** colona has quit IRC (Read error: Operation timed out) [06:10] *** colona has joined #archiveteam-bs [06:11] *** BnAboyZ has joined #archiveteam-bs [06:11] *** decay has quit IRC (Remote host closed the connection) [06:19] *** decay has joined #archiveteam-bs [06:34] *** Atom__ has quit IRC (Ping timeout: 252 seconds) [06:53] *** Mateon1 has joined #archiveteam-bs [07:29] *** SynMonger has joined #archiveteam-bs [07:33] *** abstract has quit IRC (Read error: Operation timed out) [07:34] JAA: I noticed that you did the social media I asked for with !ao as opposed to !a and I've wondering why? Do social media sites have lots of hidden links that loop or something? [08:13] dashcloud: you can look for one of your capture card [08:14] another tape is out of sync for some reason [08:14] but really i may need a new computer also cause my usb ports are acting weird [08:17] *** killsushi has quit IRC (Quit: Leaving) [08:17] *** killsushi has joined #archiveteam-bs [08:44] latest post: https://www.patreon.com/posts/having-problem-25315413 [08:51] *** wp494 has quit IRC (Ping timeout: 506 seconds) [08:51] *** wp494 has joined #archiveteam-bs [08:56] *** xoxo has quit IRC (Ping timeout: 265 seconds) [08:57] *** xoxo has joined #archiveteam-bs [09:10] *** atomicthu has quit IRC (Read error: Operation timed out) [09:10] *** atomicthu has joined #archiveteam-bs [09:30] Huh, we're looking for volunteer writers? I wasn't aware of that. [09:30] writing what? [09:31] I don't know. See what rustypand wrote yesterday at 23:28 UTC. [09:33] Exairnous: An !a job on a social media page almost never works (Mastodon being an exception). The reason is that those sites rely heavily on JavaScript/xmlHttpRequests to load more content, and the bot can't do that. So I retrieve the individual posts' URLs with my own tool (snscrape) and then archive each of those instead. It's not a perfect solution, in particular because the profile page won't work [09:33] correctly (no scrolling), but at least the content's preserved. [09:35] Oh fusl - start communicating with me or others before uploading piles of archiveteam items into the open collections [09:38] I mostly noticed because my thing that tells me how many things uploaded into the open collections noticed the spike. [09:39] k ill stop the uploading, feel free to delete my stuff in the open collections, do note though that i deleted tbem from my side [09:39] WAIT NO [09:39] NO [09:39] N O [09:40] the_office_no_gif.gif [09:40] ALARM ALARM ALARM [09:40] what [09:40] I MEAN, let me know because we have faculty to just have you upload directly into the archive team stacks [09:40] Like, skip the line, go right into the collections [09:41] I just shoved all your shit into https://archive.org/details/archiveteam_googleplus [09:41] I can do things, I have powers [09:42] But you might as well be uploading directly [09:42] Same with these minecraft forums, which I'm going after next. [09:42] i dont have access to that? [09:42] No, you don't [09:42] But I can give you access [09:42] I can do that [09:42] This is like Archive Team Top Uploader 101 [09:43] How have they all not told you this in whatever seedy pub at the docks you all meet in [09:43] multiples in #googleminus asked for access, including HCross and kiska, we all dont have access yet so ¯\_(ツ)_/¯ [09:43] Hcross certainly has access [09:43] I believe kiska does too [09:43] they do? [09:44] well then i'm the only one without it [09:44] Access is not hard [09:44] You probably are! [09:44] if you could grant me access that would be great [09:44] I have access [09:45] 23:56 <@HCross> I dont have collection, its going straight into opensource atm [09:45] ¯\_(ツ)_/¯ [09:47] We got access about 24 hrs ago, we assumed you got it as well [09:47] nope [09:47] I asked for email addresses [09:47] I think I told you in voice... [09:48] You are all adorable bon-bons [09:48] Anyway, I just made a minecraft forum collection, it's throwing in the 416 items [09:48] i'm gonna let the uploaders finish and exit after the current items are uploaded [09:48] http://fos.textfiles.com/RECOGNIZER/ Here's how I see all the stuff [09:49] That 1203 in the texts list upper left. I see a spike, I go see who's being the hero [09:50] Don't be thin-skinned, Fusl [09:50] Not a quality that works in this bag of marbles [09:50] Here, go read all this crazy Word-processing shiznat that Marcin Wichary is uploading [09:50] https://archive.org/details/@marcin_wichary [09:51] I find it calming [09:51] anyways if you can get me access to the google- collection, perfect, if not, also fine, i'll just push to someone else and call it a day idc ¯\_(ツ)_/¯ [09:52] as well as the mcf [09:55] You now have access to archiveteam_googleplus [09:55] cool [09:55] kiska or HCross: please assist in modifying the config? [09:55] Getting you the other one. (Things R slow) [09:56] IA_COLLECTION="opensource" -> IA_COLLECTION="archiveteam_googleplus" ? [09:56] Yep [09:56] Save then run factory [09:57] Boop, you also have access to archiveteam_minecraftforums [09:57] yeet, you are the hero [09:58] now to figure out how to upload into a specific collection with the ia cli [09:59] ia upload blach blah blah -m "collection:" [10:01] this correct? --metadata=mediatype:web --metadata=collection:archiveteam_minecraftforums [10:01] I use -m " " and find it works better [10:01] -m "mediatype:web" -m "collection:archiveteam_minecraftforums" [10:02] k, thanks [10:02] --metadata=mediatype:web works perfectly fine as well. [10:02] Well, maybe not if you're on Windows or something, but who does that anyway? :-) [10:04] In only 5 more hours, my machine will be done uploading the FIRST 1.4tb batch of Minecraft Artifacts. [10:08] I can see that the 645 remaining gigabytes of what's uploaded so far is going in, THAT's taking an hour every 50gb [10:08] SketchCow: is uploading to the open collection generally fine for completely random grab-site warcs or do you want me to push those into a separate collection? [10:09] *** Jens has quit IRC (Remote host closed the connection) [10:10] *** Jens has joined #archiveteam-bs [10:14] If someone uploads WARC archives to the general upload collection, it will get pushed into the outsider WARCs collection and never go into the wayback [10:14] The archivebot is the way for random grab-site warcs [10:16] SketchCow: my grab-site jobs are ones that archivebot fails on [10:16] for example, when pipeline ips get banned from websites due to request rate [10:16] Give it a name like archivebot_alt [10:16] archivebot_alt_* [10:18] just the item name prefixed with archivebot_alt_? [10:18] and no special collection? [10:18] It should go into archivebot's collection. [10:19] Uh, do we want that? That collection's already annoying enough to handle. [10:19] I don't know, tell me why it's a bad idea [10:19] What makes it annoying to handle [10:20] Well, partially it's just bugs in the ArchiveBot viewer, but inconsistent filenames etc. [10:20] But at least all files currently belong to some ArchiveBot job. [10:21] If we start throwing other files in there, it gets even messier. [10:21] Messier in what way? it all get just gets yanked in by IA anyway [10:21] Yeah, I mean more like "I want to find the WARCs that belong to ArchiveBot job X". [10:22] For example, when the site's excluded from the WBM or blocked by robots.txt, but you want to check what it grabbed. [10:22] So we could make an archivebot_alt collection [10:24] Yeah, that'd work I guess. Although at that point we might as well give it a different name since it doesn't have anything to do with ArchiveBot other than that there's some overlap of which sites are grabbed. [10:24] "archivebot_alt" to me suggests that it's another instance of ArchiveBot or something. [10:29] "notarchivebot" [10:31] SketchCow: speaking of "never go into the wayback", do the minecraftforum warcs go into the WBM? [10:31] since that's technically what i grabbed them for [10:38] Well, now they will. [10:41] *** Jopik has joined #archiveteam-bs [10:48] Can you name it archiveteam_grabsite ? [10:53] I could. [10:57] *** argus has quit IRC (Remote host closed the connection) [10:57] *** argus has joined #archiveteam-bs [11:51] *** mr_archiv has quit IRC (west.us.hub irc.mzima.net) [11:51] *** Terbium has quit IRC (west.us.hub irc.mzima.net) [11:51] *** Hani has quit IRC (west.us.hub irc.mzima.net) [11:51] *** evul has quit IRC (west.us.hub irc.mzima.net) [11:51] *** purplebot has quit IRC (west.us.hub irc.mzima.net) [11:51] *** LFlare has quit IRC (west.us.hub irc.mzima.net) [11:51] *** Coderjo_ has quit IRC (west.us.hub irc.mzima.net) [11:51] *** Fusl has quit IRC (west.us.hub irc.mzima.net) [11:51] *** casc0de has quit IRC (west.us.hub irc.mzima.net) [11:51] *** Soni has quit IRC (west.us.hub irc.mzima.net) [11:51] *** svchfoo3 has quit IRC (west.us.hub irc.mzima.net) [11:51] *** tjg1_ has quit IRC (west.us.hub irc.mzima.net) [12:00] *** mr_archiv has joined #archiveteam-bs [12:00] *** Terbium has joined #archiveteam-bs [12:00] *** Hani has joined #archiveteam-bs [12:00] *** evul has joined #archiveteam-bs [12:00] *** purplebot has joined #archiveteam-bs [12:00] *** LFlare has joined #archiveteam-bs [12:00] *** Coderjo_ has joined #archiveteam-bs [12:00] *** Fusl has joined #archiveteam-bs [12:00] *** casc0de has joined #archiveteam-bs [12:00] *** Soni has joined #archiveteam-bs [12:00] *** svchfoo3 has joined #archiveteam-bs [12:00] *** tjg1_ has joined #archiveteam-bs [12:00] *** irc.mzima.net sets mode: +o svchfoo3 [12:12] *** odemgi has joined #archiveteam-bs [12:15] *** odemgi_ has quit IRC (Ping timeout: 252 seconds) [12:18] *** BlueMax has quit IRC (Quit: Leaving) [12:21] *** odemg has quit IRC (Ping timeout: 615 seconds) [12:26] *** deevious has quit IRC (Remote host closed the connection) [12:28] *** odemg has joined #archiveteam-bs [13:08] *** deevious has joined #archiveteam-bs [13:13] *** Atom__ has joined #archiveteam-bs [13:17] *** killsushi has quit IRC (Quit: Leaving) [13:18] *** abstract has joined #archiveteam-bs [13:48] *** ndiddy has joined #archiveteam-bs [13:56] Fusl: you used grab-site for the minecraftforums? [14:48] *** argus has quit IRC (Remote host closed the connection) [14:48] *** argus has joined #archiveteam-bs [15:06] *** fredgido has joined #archiveteam-bs [15:07] *** fredgido_ has quit IRC (Ping timeout: 252 seconds) [15:24] *** slyphic has quit IRC (Read error: Connection reset by peer) [15:24] *** slyphic has joined #archiveteam-bs [15:38] *** BlueMax has joined #archiveteam-bs [15:38] https://twitter.com/DylanLJMartin/status/1101873832003018757 [15:49] *** rustypand has joined #archiveteam-bs [15:51] arkiver: yes, it required lots of ignores that i had to manually compile [15:55] *** rustypand has left http://quassel-irc.org - Chat comfortably. Anywhere. [16:57] *** slyphic has quit IRC (Quit: leaving) [16:57] *** slyphic has joined #archiveteam-bs [17:36] *** marked2go has joined #archiveteam-bs [17:52] *** wp494 has quit IRC (Read error: Operation timed out) [17:52] *** wp494 has joined #archiveteam-bs [18:02] *** abstract has quit IRC (Read error: Operation timed out) [18:23] *** Pixi` has quit IRC (Quit: Pixi`) [18:24] *** wp494 has quit IRC (Ping timeout: 255 seconds) [18:25] *** wabu has quit IRC (Read error: Operation timed out) [18:25] *** wabu has joined #archiveteam-bs [18:27] *** Pixi has joined #archiveteam-bs [18:40] *** Stiletto has quit IRC () [18:59] *** icedice has joined #archiveteam-bs [19:09] *** wp494 has joined #archiveteam-bs [19:25] *** wp494 has quit IRC (Ping timeout: 268 seconds) [19:43] JAA: Interesting! And thanks for putting up with my noob questions. [19:45] *** kiska1 has quit IRC (Read error: Operation timed out) [19:47] *** kiska1 has joined #archiveteam-bs [19:59] *** Stiletto has joined #archiveteam-bs [20:08] *** BlueMax has quit IRC (Quit: Leaving) [20:10] *** BlueMax has joined #archiveteam-bs [20:37] *** Exairnous has quit IRC (Ping timeout: 615 seconds) [20:55] Did we make any progress regarding .eu domains owned by UK residents? [21:36] *** wp494 has joined #archiveteam-bs [21:49] *** abstract has joined #archiveteam-bs [21:52] *** Exairnous has joined #archiveteam-bs [22:00] *** killsushi has joined #archiveteam-bs [22:35] *** abstract has quit IRC (Read error: Operation timed out) [23:13] *** ndiddy has quit IRC ()