[00:00] our code should pick those out and retry (or fail) [00:00] and with fail I mean, fail and give the item to someone else eventually so no data is lost [00:00] and no bad data is archived [00:00] glad to hear :) [00:01] BTW, do you have any container for FreeBSD (i.e. bastille) [00:01] as there is no docker under FreeBSD [00:01] so for now most project are manual with scripts, if you are familiar with that [00:01] not sure, maybe Fusl would know more about that [00:01] but Fusl is on hackint [00:01] same channels on hackint [00:02] thank you, I can ask him about it [00:02] her* [00:02] sorry :) :) [00:03] about scripts: where could I read about them? [00:04] are they just shell scripts? how much manual interaction do they require? [00:04] our wiki, github.com/archiveteam, tracker.archiveteam.org [00:04] https://www.archiveteam.org/index.php?title=Projects#Manual_projects_2 ? [00:05] these? [00:05] yep, but also see the READMEs on github [00:05] and you might want to join hackint [00:06] should I reconnect there now? [00:08] also, why do you recommend me to participate in script projects rather than warrior-based ones? [00:08] are they more important for saving sites? [00:08] or something else? [00:13] A lot of the new projects are discussions are on hackint, however the general chats (#archiveteam #archiveteam-bs #archiveteam-ot) are still on EFnet. I recommend using a client that allows you to connect to both to keep up [00:14] thanks, I will try [00:14] If you can run a OVA VM image then you can run the Warrior Appliance (VM + Docker + Warrior Container + Watchtower autoupdate). [00:15] A lot of the new projects are in rapid development so they arent pushed to Warriors immediately but as manual scripts [00:15] However, many of the scripts are automatically built into docker containers on updates [00:16] are new project of more value than old ones? [00:16] could I just run the latter? [00:17] (may be just need some time to have accustomed with new ones, if you advice me to do) [00:17] I for sure will read much more to be able to help [00:17] http://tracker.archiveteam.org/ shows some of the active (ish projects) [00:18] My first and main question here was about the whole ability to run the project from russian network [00:18] you can try URLteam for now if you want to get started on a warrior project (it's the default) [00:18] OK [00:19] I feel that I should help where there is more need in help, not where it would be more easy for me to run :) [00:20] just need some time to read a lot about all this :) [00:20] You can try reddit test project then as that's what we are working on [00:21] It shows up on warrior (you can manually select it from the warrior control panel) [00:21] Although im not sure if it auto updates the scripts (i'll check now) [00:22] btw, could it be run under FreeBSD's BHYVE? [00:23] I unfortunately do not have experience when FreeBSD OS or BHYVE virtualization [00:23] (either way FreeBSD do support Virtualbox, so I'm rather OK with it) [00:23] OK [00:23] if you can run docker you can do the following: sudo docker run --rm -d warcforceone/reddit-grab:latest --concurrent 20 ${NICKNAME} [00:23] change ${NICKNAME} to what you want to show up as [00:23] sorry, docker isn't available under FreeBSD… [00:24] but I could use it via warrior under Virtualbox? [00:25] Yeah, you can probably run a linux OS as a VM and then either run reddit-grab as a docker or as a script from here: https://github.com/ArchiveTeam/reddit-grab [00:25] The project readme states that FreeBSD support is currently unknown [00:25] yes, it's rare OS, especially now [00:26] what should I expect about bandwidth? and disk space ? [00:27] *** HP_Archiv has quit IRC (Quit: Leaving) [00:28] lemme check [00:29] (I have 100 Mbit/s FTTB and 640 Gb HDD with ZFS) [00:30] I'm averaging about 6MB/s with around <10GB disk usage [00:30] OK :) [00:30] this will probably vary per machine and internet connection [00:30] also because fusl is utterly annihilating the tracker and Reddit [00:33] korobkov: i forgot to mention, the project chat is at #shreddit over at hackint [00:34] thank you, so as you adviced me to start with, I will follow your advice :) [00:35] :) [00:35] just eed some sleep now (it's 04:35 UTC+04 here) and I still hadn't slept :) [00:35] good night! [00:36] thank you! see you later :) [00:36] * korobkov is going to sleep/offline mode… [00:36] *** korobkov has quit IRC (Quit: ERC (IRC client for Emacs 26.3)) [00:46] *** bsmith093 has quit IRC (Ping timeout: 265 seconds) [01:01] *** bsmith093 has joined #archiveteam-bs [01:02] *** Ravenloft has quit IRC (Remote host closed the connection) [02:05] *** britmob has quit IRC (Ping timeout: 265 seconds) [02:12] *** HP_Archiv has joined #archiveteam-bs [02:13] *** britmob has joined #archiveteam-bs [02:43] *** HP_Archiv has quit IRC (Quit: Leaving) [03:23] *** HP_Archiv has joined #archiveteam-bs [03:38] *** HP_Archiv has quit IRC (Quit: Leaving) [03:45] *** qw3rty__ has joined #archiveteam-bs [03:53] *** qw3rty_ has quit IRC (Read error: Operation timed out) [05:10] *** Zandro has joined #archiveteam-bs [05:10] *** Zandro has left Leaving [07:06] *** K4k__ has quit IRC (Ping timeout: 260 seconds) [07:06] *** K4k__ has joined #archiveteam-bs [07:07] *** godane has quit IRC (hub.efnet.us irc.Prison.NET) [07:07] *** phirephly has quit IRC (hub.efnet.us irc.Prison.NET) [07:07] *** Pixi` has quit IRC (hub.efnet.us irc.Prison.NET) [07:07] *** superkuh has quit IRC (hub.efnet.us irc.Prison.NET) [07:07] *** maxfan8 has quit IRC (hub.efnet.us irc.Prison.NET) [07:07] *** scorche has quit IRC (hub.efnet.us irc.Prison.NET) [07:10] *** godane has joined #archiveteam-bs [07:10] *** phirephly has joined #archiveteam-bs [07:10] *** Pixi` has joined #archiveteam-bs [07:10] *** maxfan8 has joined #archiveteam-bs [07:10] *** scorche has joined #archiveteam-bs [07:10] *** jshoard has joined #archiveteam-bs [07:11] *** superkuh has joined #archiveteam-bs [07:21] *** bsmith093 has quit IRC (Read error: Operation timed out) [07:30] *** bsmith093 has joined #archiveteam-bs [07:36] *** HP_Archiv has joined #archiveteam-bs [08:01] *** lennier1 has quit IRC (Ping timeout: 857 seconds) [08:03] *** HP_Archiv has quit IRC (Quit: Leaving) [09:08] *** BlueMax has quit IRC (Read error: Connection reset by peer) [09:57] *** kiskaWee has joined #archiveteam-bs [10:34] *** kiska has quit IRC (Remote host closed the connection) [10:36] *** kiska has joined #archiveteam-bs [12:41] *** fredgido has joined #archiveteam-bs [12:48] *** fredgido_ has quit IRC (Read error: Operation timed out) [13:47] *** Terbium has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [13:48] *** Terbium has joined #archiveteam-bs [15:25] *** Arcorann has quit IRC (Read error: Connection reset by peer) [15:43] *** systwi has quit IRC (Read error: Operation timed out) [16:03] *** Nikchemny has joined #archiveteam-bs [16:03] nico_32_ How I can recognize you have a progress? Check your IA account? [16:04] *** Nikchemny has quit IRC (Client Quit) [16:10] Oh, that's "http://fileformats.archiveteam.org/ is down", not "http://fileformats.archiveteam.org/ is shutting down" [16:13] seems to be back up [16:14] :P [16:30] *** systwi has joined #archiveteam-bs [16:30] *** Maylay has quit IRC (Read error: Connection reset by peer) [16:34] *** Maylay has joined #archiveteam-bs [16:49] WE THANK YOU ALL FOR OUR INCREDIBLE FILE FORMAT JOURNEY [16:52] all file formats will be .gif, problem solved [18:23] *** Nikchemny has joined #archiveteam-bs [18:26] JAA: Do you have Telegram? There is bot ( t.me/SaveYoutubeBot ) that can save YT videos. I asked them will they upload videos on IA, they said they don't have plans for this. I asked will they and they didn't answer. Maybe someone from AT or even IA contact with them ( t.me/SYmanagerBot )? [18:26] *will they create plans for this [18:31] *** Mateon1 has quit IRC (Ping timeout: 265 seconds) [18:31] *** Mateon1 has joined #archiveteam-bs [18:32] *** lennier1 has joined #archiveteam-bs [18:33] no. [18:33] this has caused issues in the past, because people save the most mundane shit and it's just a waste of space [18:34] IA doesn't want 500x copies of gangnam style or whatever people listen to these days [18:34] Kaz: Hm, I think tha IA can think before upload data [18:35] what [18:35] Btw, looks like https://youtube.com/watch?v=ddDpdfXlMxY was saved in 360p and 720p [18:35] And it is not pop-music [18:36] you've lost me [18:36] It is about capitalism, Putin, Russia [18:36] when did we move on to russia [18:37] is this like when you go on wikipedia and see how many clicks until you find hitler [18:37] I mean, IA (in my dream) won't upload all the videos by SaveYoutubeBot, only those that looks important [18:37] Kaz: https://youtube.com/watch?v=ddDpdfXlMxY is example of saved video [18:38] uh [18:38] whoever runs the bot would upload to IA [18:38] except, they shouldn't because it's a terrible idea [18:38] ПУТИНСКИЙ КАПИТАЛИЗМ. Как Путин "спас" Россию от 90-х → (https://youtube.com/watch?v=ddDpdfXlMxY) 👤 #Вестник_Бури → (http://www.youtube.com/channel/UCQ_LYRUJzBfh-mvU14xCNMw) 🚀 360p: 117MB 🚀 720p: 324MB ✅ 1080p: 658MB Форматы для скачивания ↓ [18:38] 🚀 means video was saved [18:38] ok(??) [18:39] ✅ means video can be saved, but not saved by someone before yet [18:39] Yeah, no. [18:39] *** JAA sets mode: +o Kaz [18:39] as long as they save it somewhere that isn't IA, this all sounds fine [18:39] Why? [18:40] https://usercontent.irccloud-cdn.com/file/4ph8PMSB/image.png [18:40] That and also IA pays $1500 per T of data saved to be accessed forever [18:41] I'd rather my donation to be saving something more significant than youtube videos [18:41] Yes, I mean IA can choose what upload. The idae was to share this data with IA [18:41] that's not how uploading to IA works [18:41] it's user-submitted [18:42] https://usercontent.irccloud-cdn.com/file/4ph8PMSB/image.png Hm, interesting argument. But there are many community books, videos etc [18:42] yes [18:42] books small [18:42] Hm, ok. No is no [18:42] youtube videos of putin big [18:43] and bad [18:43] *** JAA sets mode: +o Kaz [18:43] *** JAA sets mode: +o kiska [18:45] Also, uploading content that is still available on YouTube is bad. It wastes disk space that could be used for something more useful. [18:46] Mhm, I thought that the whole IA is for saving content in case of its death. [18:46] Except really old books and films [18:48] Yeah, but IA doesn't have the resources (money) to archive YouTube. It's just too big. [18:49] Btw, does anyone moderate community collections? Looks like there are tons of useless pics (like Twitter and YT icons), sometimes porn etc [18:50] yes [18:50] Ok [18:58] yeah, there are many useless shit on IA collections [19:00] sometimes I explore those collections randomly, and half of it are religion content, and other stuff [19:00] Looks like content by Arabians [19:01] *arabs [19:01] well i have seen evangelists content too [19:02] Hm [19:02] Also, IA has collection with Bible on almost all languages [19:02] all them have uploaded hours and hours and hours of videos analysing their "holy" books [19:03] The books are holy for them. We must respect their religions [19:04] Even atheists must [19:05] *** Nikchemny has quit IRC (Quit: Page closed) [19:07] *** Nikchemny has joined #archiveteam-bs [19:08] Btw, it would be great if IA create collection for YT vids which links are on Wikipedia [19:15] *** godane has quit IRC (Read error: Connection reset by peer) [19:18] *** godane has joined #archiveteam-bs [19:22] FY, http://fileformats.archiveteam.org/ is down again. Same 'Unable to allocate memory for pool' errors as before. [19:23] < Nikchemny> Btw, does anyone moderate community collections? Looks like there are tons of useless pics (like Twitter and YT icons), sometimes porn etc [19:23] Why.... is this in #archiveteam-bs [19:24] Sorry, no more [19:25] By the way, this is literally my job [19:25] Literally. My. Job. [19:25] Like, I'm that guy. [19:25] Ah, okay. I don't know it. Sorry [19:26] What, you did it [19:26] You got into the Tyrell Corporation. I'm the guy [19:29] Btw, thanks for uploading old journals. I have a small collection of journals about cars from 2013, but don't think you will be interested in. [19:36] *** balrog has quit IRC (Remote host closed the connection) [19:37] *** Nikchemny has quit IRC (Quit: Page closed) [19:39] *** balrog has joined #archiveteam-bs [20:19] *** lennier1 has quit IRC (Ping timeout: 260 seconds) [20:21] *** lennier1 has joined #archiveteam-bs [20:30] *** VoynichCr has left [20:31] *** DogsRNice has joined #archiveteam-bs [20:32] *** lennier2 has joined #archiveteam-bs [20:39] *** antomati_ is now known as antomatic [20:40] *** lennier1 has quit IRC (Read error: Operation timed out) [20:40] *** lennier2 is now known as lennier1 [22:48] *** Arcorann has joined #archiveteam-bs [22:49] *** Arcorann has quit IRC (Remote host closed the connection) [22:50] *** Arcorann has joined #archiveteam-bs [22:54] *** Nikchemny has joined #archiveteam-bs [22:57] It would be nice to see here korobkov [22:58] *** Nikchemny has quit IRC (Client Quit) [23:07] Peerlyst is basically all Javascript, which is really annoying [23:10] *** jshoard has quit IRC (Leaving) [23:30] Not just "basically", entirely except for robots.txt and sitemap, as far as I can see. Eww [23:37] With the explosion of client side rendered sites, i am not surprised in the least [23:42] *** HP_Archiv has joined #archiveteam-bs [23:50] *** britmob has quit IRC (Read error: Connection reset by peer) [23:53] *** britmob has joined #archiveteam-bs