[00:26] *** ephemer0l has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [00:31] *** ephemer0l has joined #archiveteam-ot [01:52] *** killsushi has joined #archiveteam-ot [03:05] *** Maylay has quit IRC (Pipe Terminated) [03:07] *** Maylay has joined #archiveteam-ot [03:07] *** Maylay has quit IRC (Remote host closed the connection!) [03:08] *** Maylay has joined #archiveteam-ot [03:58] *** qw3rty119 has joined #archiveteam-ot [04:04] *** qw3rty118 has quit IRC (Read error: Operation timed out) [04:59] *** godane1 has joined #archiveteam-ot [05:00] *** godane has quit IRC (Read error: Operation timed out) [06:03] *** m007a83 has quit IRC (Read error: Operation timed out) [06:07] *** Dimtree has joined #archiveteam-ot [07:39] Should I create/collect ZIM files of wikis (fandom and mediawikis) or would a WARC format be better? I would like to collect edit history, sources, talk pages, user pages, etc. (and of course the actual content) [07:44] if its a small wiki then archivebot could work with ignores. Best bet is wikiteam but I dont know what kind of stuff they actually grab I think its just the XML stuff mainly [07:44] Oh I was thinking for personal use... [07:45] But yeah of course archivebot as well [07:48] I am not sure then honestly. Archivebot would only be good for small wikis if you want to collect edit history and sources and all the other stuff you want to grab cause that stuff AFAIK grabs strangely and ends up being ignored cause it goes everywhere [07:58] *** m007a83 has joined #archiveteam-ot [09:45] *** VerifiedJ has joined #archiveteam-ot [10:08] https://www.zdnet.com/article/hackers-breach-fsb-contractor-expose-tor-deanonymization-project/ Are we allowed to backup a copy of these files? [10:17] systwi: ZIM files are probably most efficient and usable for personal use. WARCs will be much larger, and the XML dumps can't be browsed directly I think. A tool to do so would be incredibly useful though. [11:00] *** BlueMax has quit IRC (Quit: Leaving) [12:18] *** killsushi has quit IRC (Quit: Leaving) [14:36] archive all the formats! [15:10] *** Hani has quit IRC (Quit: Hani) [15:17] *** Verified_ has quit IRC (Ping timeout: 252 seconds) [15:55] SketchCow , Igloo: ping, those ISP cds are ending in around 5 minutes [15:55] the lot of 115 unique ones has already ended at 0 bids [16:06] *** schbirid has joined #archiveteam-ot [16:38] *** Hani has joined #archiveteam-ot [17:16] *** DogsRNice has joined #archiveteam-ot [17:50] Is there any known name for the month-year format of: 7'19 [18:12] JAA: I'm just unsure if ZIM creation tools can save everything I need. I thought they only save the page and (sometimes) images/audio/video. I'm looking to make the grabs as complete as possible. [18:15] *** Joseph_ has joined #archiveteam-ot [18:15] *** VerifiedJ has quit IRC (Read error: Connection reset by peer) [18:33] *** Leslie has quit IRC (Read error: Operation timed out) [18:33] "* 'Reward' - a project to covertly penetrate P2P networks like the one used for torrents." -- are BTT (bittorrent coin) a russian invention? [18:41] *** Dallas has joined #archiveteam-ot [18:50] *** hi has joined #archiveteam-ot [19:07] *** Ryz has joined #archiveteam-ot [19:08] On my daily hunt for finding company acquisitions, I found https://cointelegraph.com/ - which came from https://cointelegraph.com/news/fidelitys-crypto-branch-files-for-a-new-york-trust-license-report while searching for 'acquire' [19:08] ...These illustrations are pretty awesome~ x3 [19:09] I'm reminded of those YouTube thumbnails where they hire artists to drew these thumbnails [19:11] I am actually quite afraid of 2 future projects, #shreddit and #sketchedout. Cause there is a certain someone who can(will) spin up many instances and slam the rsync targets with traffic(to the point where it can't send data out fast enough) [19:11] I have 3gbps of capacity right now, and I am working on increasing that because of that possibility that we can't ingest fast enough [19:45] *** VerifiedJ has joined #archiveteam-ot [19:45] *** Joseph_ has quit IRC (Read error: Connection reset by peer) [19:46] *** Ryz has quit IRC (Quit: ChatZilla 0.9.92-rdmsoft [XULRunner 35.0.1/20150122214805]) [19:46] *** VerifiedJ has quit IRC (Read error: Connection reset by peer) [19:47] *** VerifiedJ has joined #archiveteam-ot [19:50] *** Ravenloft has joined #archiveteam-ot [19:57] *** DogsRNice has quit IRC (Ping timeout: 252 seconds) [19:58] *** Dj-Wawa has joined #archiveteam-ot [19:59] *** DogsRNice has joined #archiveteam-ot [20:07] *** hi has quit IRC (Quit: Page closed) [20:09] *** Leslie has joined #archiveteam-ot [20:19] *** DogsRNice has quit IRC (Ping timeout: 252 seconds) [20:32] systwi: For complete archives, the wikiteam tools are probably best. Those just aren't very accessible, as mentioned earlier. [20:38] *** schbirid has quit IRC (Remote host closed the connection) [20:39] *** DogsRNice has joined #archiveteam-ot [20:57] kiska: who is that certain someone? [20:58] Fusl: ^ [20:58] xD [20:59] let's hope those projects arent going to be rape limited [20:59] is #sketchedout the day SketchCow drops off the Internet and everyone scrambles to archive his shit [20:59] *** Ravenloft has quit IRC (Remote host closed the connection) [20:59] Oh we might have to do a server rate limit to keep our rsync targets from exploding [20:59] :D [21:00] Also the tracker. [21:00] the rsync targets /are/ the rate limiting :D [21:00] :P [21:00] thisisfine.jpg [21:01] And yeah, the targets automatically limit the rate, adjusting the rate on the tracker isn't needed for that. [21:01] I am pretty sure during #googleminus I was bringing up new servers every 6 hours [21:02] I also made a small spelling mistake in the rsyncd.conf that allowed >2k connections to one of the targets xD [21:02] Which promptly crashed the host node [21:42] *** VerifiedJ has quit IRC (Read error: Connection reset by peer) [21:42] *** VerifiedJ has joined #archiveteam-ot [22:09] kiska: you made a spelling mistake which allowed unlimited amount of connections [22:09] Yes xD [22:11] That was fun trying to debug, but xD [22:12] "why does the load average go over 9000 when i enable this rsync target?" - "yes." [22:13] XD [22:16] Psh it was only over 50 xD [22:28] *** Dj-Wawa has quit IRC (Quit: Connection closed for inactivity) [22:29] *** Dj-Wawa has joined #archiveteam-ot [22:36] *** dashcloud has quit IRC (Remote host closed the connection) [22:37] *** dashcloud has joined #archiveteam-ot [22:56] TIL that people have misread my (full) nickname as Just Another Anarchist and Just Another Antichrist. :-) [23:10] You arent an anarchist and/or the antichrist?!?!?!?!?! JAA I HAVE BEEN LIED TO [23:19] https://techraptor.net/content/armor-games-data-breach-january-2019 LETS STORE THE SALTS NEXT TO THE PASSWORDSS [23:22] *** BlueMax has joined #archiveteam-ot [23:32] *** benjinsmi has joined #archiveteam-ot [23:33] *** benjins has quit IRC (Ping timeout: 252 seconds) [23:42] *** benjins has joined #archiveteam-ot [23:43] *** benjinsmi has quit IRC (Ping timeout: 604 seconds) [23:47] *** VerifiedJ has quit IRC (Read error: Operation timed out)