#archiveteam-ot 2019-07-21,Sun

↑back Search

Time Nickname Message
00:26 🔗 ephemer0l has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.)
00:31 🔗 ephemer0l has joined #archiveteam-ot
01:52 🔗 killsushi has joined #archiveteam-ot
03:05 🔗 Maylay has quit IRC (Pipe Terminated)
03:07 🔗 Maylay has joined #archiveteam-ot
03:07 🔗 Maylay has quit IRC (Remote host closed the connection!)
03:08 🔗 Maylay has joined #archiveteam-ot
03:58 🔗 qw3rty119 has joined #archiveteam-ot
04:04 🔗 qw3rty118 has quit IRC (Read error: Operation timed out)
04:59 🔗 godane1 has joined #archiveteam-ot
05:00 🔗 godane has quit IRC (Read error: Operation timed out)
06:03 🔗 m007a83 has quit IRC (Read error: Operation timed out)
06:07 🔗 Dimtree has joined #archiveteam-ot
07:39 🔗 systwi Should I create/collect ZIM files of wikis (fandom and mediawikis) or would a WARC format be better? I would like to collect edit history, sources, talk pages, user pages, etc. (and of course the actual content)
07:44 🔗 Flashfire if its a small wiki then archivebot could work with ignores. Best bet is wikiteam but I dont know what kind of stuff they actually grab I think its just the XML stuff mainly
07:44 🔗 systwi Oh I was thinking for personal use...
07:45 🔗 systwi But yeah of course archivebot as well
07:48 🔗 Flashfire I am not sure then honestly. Archivebot would only be good for small wikis if you want to collect edit history and sources and all the other stuff you want to grab cause that stuff AFAIK grabs strangely and ends up being ignored cause it goes everywhere
07:58 🔗 m007a83 has joined #archiveteam-ot
09:45 🔗 VerifiedJ has joined #archiveteam-ot
10:08 🔗 Flashfire https://www.zdnet.com/article/hackers-breach-fsb-contractor-expose-tor-deanonymization-project/ Are we allowed to backup a copy of these files?
10:17 🔗 JAA systwi: ZIM files are probably most efficient and usable for personal use. WARCs will be much larger, and the XML dumps can't be browsed directly I think. A tool to do so would be incredibly useful though.
11:00 🔗 BlueMax has quit IRC (Quit: Leaving)
12:18 🔗 killsushi has quit IRC (Quit: Leaving)
14:36 🔗 Somebody2 archive all the formats!
15:10 🔗 Hani has quit IRC (Quit: Hani)
15:17 🔗 Verified_ has quit IRC (Ping timeout: 252 seconds)
15:55 🔗 betamax SketchCow , Igloo: ping, those ISP cds are ending in around 5 minutes
15:55 🔗 betamax the lot of 115 unique ones has already ended at 0 bids
16:06 🔗 schbirid has joined #archiveteam-ot
16:38 🔗 Hani has joined #archiveteam-ot
17:16 🔗 DogsRNice has joined #archiveteam-ot
17:50 🔗 Raccoon Is there any known name for the month-year format of: 7'19
18:12 🔗 systwi JAA: I'm just unsure if ZIM creation tools can save everything I need. I thought they only save the page and (sometimes) images/audio/video. I'm looking to make the grabs as complete as possible.
18:15 🔗 Joseph_ has joined #archiveteam-ot
18:15 🔗 VerifiedJ has quit IRC (Read error: Connection reset by peer)
18:33 🔗 Leslie has quit IRC (Read error: Operation timed out)
18:33 🔗 Raccoon "* 'Reward' - a project to covertly penetrate P2P networks like the one used for torrents." -- are BTT (bittorrent coin) a russian invention?
18:41 🔗 Dallas has joined #archiveteam-ot
18:50 🔗 hi has joined #archiveteam-ot
19:07 🔗 Ryz has joined #archiveteam-ot
19:08 🔗 Ryz On my daily hunt for finding company acquisitions, I found https://cointelegraph.com/ - which came from https://cointelegraph.com/news/fidelitys-crypto-branch-files-for-a-new-york-trust-license-report while searching for 'acquire'
19:08 🔗 Ryz ...These illustrations are pretty awesome~ x3
19:09 🔗 Ryz I'm reminded of those YouTube thumbnails where they hire artists to drew these thumbnails
19:11 🔗 kiska I am actually quite afraid of 2 future projects, #shreddit and #sketchedout. Cause there is a certain someone who can(will) spin up many instances and slam the rsync targets with traffic(to the point where it can't send data out fast enough)
19:11 🔗 kiska I have 3gbps of capacity right now, and I am working on increasing that because of that possibility that we can't ingest fast enough
19:45 🔗 VerifiedJ has joined #archiveteam-ot
19:45 🔗 Joseph_ has quit IRC (Read error: Connection reset by peer)
19:46 🔗 Ryz has quit IRC (Quit: ChatZilla 0.9.92-rdmsoft [XULRunner 35.0.1/20150122214805])
19:46 🔗 VerifiedJ has quit IRC (Read error: Connection reset by peer)
19:47 🔗 VerifiedJ has joined #archiveteam-ot
19:50 🔗 Ravenloft has joined #archiveteam-ot
19:57 🔗 DogsRNice has quit IRC (Ping timeout: 252 seconds)
19:58 🔗 Dj-Wawa has joined #archiveteam-ot
19:59 🔗 DogsRNice has joined #archiveteam-ot
20:07 🔗 hi has quit IRC (Quit: Page closed)
20:09 🔗 Leslie has joined #archiveteam-ot
20:19 🔗 DogsRNice has quit IRC (Ping timeout: 252 seconds)
20:32 🔗 JAA systwi: For complete archives, the wikiteam tools are probably best. Those just aren't very accessible, as mentioned earlier.
20:38 🔗 schbirid has quit IRC (Remote host closed the connection)
20:39 🔗 DogsRNice has joined #archiveteam-ot
20:57 🔗 Fusl kiska: who is that certain someone?
20:58 🔗 kiska Fusl: ^
20:58 🔗 kiska xD
20:59 🔗 Fusl let's hope those projects arent going to be rape limited
20:59 🔗 Raccoon is #sketchedout the day SketchCow drops off the Internet and everyone scrambles to archive his shit
20:59 🔗 Ravenloft has quit IRC (Remote host closed the connection)
20:59 🔗 kiska Oh we might have to do a server rate limit to keep our rsync targets from exploding
20:59 🔗 Fusl :D
21:00 🔗 JAA Also the tracker.
21:00 🔗 Fusl the rsync targets /are/ the rate limiting :D
21:00 🔗 kiska :P
21:00 🔗 JAA thisisfine.jpg
21:01 🔗 JAA And yeah, the targets automatically limit the rate, adjusting the rate on the tracker isn't needed for that.
21:01 🔗 kiska I am pretty sure during #googleminus I was bringing up new servers every 6 hours
21:02 🔗 kiska I also made a small spelling mistake in the rsyncd.conf that allowed >2k connections to one of the targets xD
21:02 🔗 kiska Which promptly crashed the host node
21:42 🔗 VerifiedJ has quit IRC (Read error: Connection reset by peer)
21:42 🔗 VerifiedJ has joined #archiveteam-ot
22:09 🔗 Fusl kiska: you made a spelling mistake which allowed unlimited amount of connections
22:09 🔗 kiska Yes xD
22:11 🔗 kiska That was fun trying to debug, but xD
22:12 🔗 Fusl "why does the load average go over 9000 when i enable this rsync target?" - "yes."
22:13 🔗 kiska XD
22:16 🔗 kiska Psh it was only over 50 xD
22:28 🔗 Dj-Wawa has quit IRC (Quit: Connection closed for inactivity)
22:29 🔗 Dj-Wawa has joined #archiveteam-ot
22:36 🔗 dashcloud has quit IRC (Remote host closed the connection)
22:37 🔗 dashcloud has joined #archiveteam-ot
22:56 🔗 JAA TIL that people have misread my (full) nickname as Just Another Anarchist and Just Another Antichrist. :-)
23:10 🔗 Flashfire You arent an anarchist and/or the antichrist?!?!?!?!?! JAA I HAVE BEEN LIED TO
23:19 🔗 Flashfire https://techraptor.net/content/armor-games-data-breach-january-2019 LETS STORE THE SALTS NEXT TO THE PASSWORDSS
23:22 🔗 BlueMax has joined #archiveteam-ot
23:32 🔗 benjinsmi has joined #archiveteam-ot
23:33 🔗 benjins has quit IRC (Ping timeout: 252 seconds)
23:42 🔗 benjins has joined #archiveteam-ot
23:43 🔗 benjinsmi has quit IRC (Ping timeout: 604 seconds)
23:47 🔗 VerifiedJ has quit IRC (Read error: Operation timed out)

irclogger-viewer