#archiveteam-bs 2020-07-19,Sun

↑back Search

Time Nickname Message
00:00 πŸ”— closure_ has quit IRC (Read error: Operation timed out)
00:00 πŸ”— closure has joined #archiveteam-bs
00:01 πŸ”— Kaz has quit IRC (Read error: Operation timed out)
00:01 πŸ”— kyledrake has quit IRC (Read error: Operation timed out)
00:01 πŸ”— phuzion has quit IRC (Read error: Operation timed out)
00:01 πŸ”— second_ has quit IRC (Read error: Operation timed out)
00:02 πŸ”— kyledrake has joined #archiveteam-bs
00:02 πŸ”— Kaz has joined #archiveteam-bs
00:03 πŸ”— atomicthu has quit IRC (Read error: Operation timed out)
00:03 πŸ”— second has joined #archiveteam-bs
00:04 πŸ”— phuzion has joined #archiveteam-bs
00:04 πŸ”— Ryz has quit IRC (Ping timeout: 496 seconds)
00:04 πŸ”— pikami_ has quit IRC (Ping timeout: 496 seconds)
00:05 πŸ”— atomicthu has joined #archiveteam-bs
00:05 πŸ”— Larsenv has quit IRC (Read error: Operation timed out)
00:05 πŸ”— Larsenv has joined #archiveteam-bs
00:05 πŸ”— svchfoo3 has quit IRC (Ping timeout: 496 seconds)
00:05 πŸ”— pikami has joined #archiveteam-bs
00:06 πŸ”— Raccoon has joined #archiveteam-bs
00:08 πŸ”— antomati_ has joined #archiveteam-bs
00:08 πŸ”— Datechnom has quit IRC (Ping timeout: 496 seconds)
00:10 πŸ”— Hooloovoo has quit IRC (Read error: Operation timed out)
00:10 πŸ”— Hooloovoo has joined #archiveteam-bs
00:14 πŸ”— Jonimoose has quit IRC (Ping timeout: 496 seconds)
00:15 πŸ”— antomatic has quit IRC (Ping timeout: 496 seconds)
00:19 πŸ”— Jonimoose has joined #archiveteam-bs
00:47 πŸ”— SynMonger has quit IRC (Quit: Wait, what?)
00:51 πŸ”— SynMonger has joined #archiveteam-bs
01:08 πŸ”— svchfoo3 has joined #archiveteam-bs
01:08 πŸ”— svchfoo1 sets mode: +o svchfoo3
01:08 πŸ”— Datechnom has joined #archiveteam-bs
01:21 πŸ”— Ryz has joined #archiveteam-bs
01:27 πŸ”— PovAddict has joined #archiveteam-bs
03:49 πŸ”— qw3rty_ has joined #archiveteam-bs
03:57 πŸ”— qw3rty has quit IRC (Read error: Operation timed out)
04:23 πŸ”— DogsRNice has quit IRC (Read error: Connection reset by peer)
04:44 πŸ”— prq has quit IRC (Read error: Connection reset by peer)
04:54 πŸ”— Stiletto has joined #archiveteam-bs
04:58 πŸ”— Stilett0 has quit IRC (Ping timeout: 272 seconds)
05:02 πŸ”— SketchCow We saved and returned upcoming.org.
05:03 πŸ”— SketchCow And it was bought back from Yahoo! and put back up
05:03 πŸ”— SketchCow The business has been not in great shape, but he did do it.
05:03 πŸ”— Ctrl has quit IRC (Read error: Operation timed out)
05:06 πŸ”— duh has joined #archiveteam-bs
05:06 πŸ”— Stilett0 has joined #archiveteam-bs
05:08 πŸ”— Stiletto has quit IRC (Ping timeout: 260 seconds)
05:10 πŸ”— BnAboyZ has quit IRC (Ping timeout: 857 seconds)
05:13 πŸ”— legoktm has quit IRC (Read error: Connection reset by peer)
05:14 πŸ”— antomatic has joined #archiveteam-bs
05:15 πŸ”— BnAboyZ has joined #archiveteam-bs
05:16 πŸ”— SJon___ has quit IRC (Ping timeout: 857 seconds)
05:21 πŸ”— SJon___ has joined #archiveteam-bs
05:25 πŸ”— antomati_ has quit IRC (Read error: Operation timed out)
05:55 πŸ”— nuc has joined #archiveteam-bs
05:55 πŸ”— nuc is now known as Somebody2
05:57 πŸ”— Ctrl has joined #archiveteam-bs
06:51 πŸ”— PovAddict has quit IRC (Quit: Konversation terminated!)
07:56 πŸ”— WalkFly has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
10:07 πŸ”— VADemon has joined #archiveteam-bs
10:24 πŸ”— VADemon has quit IRC (left4dead)
10:35 πŸ”— BeefyBoot has quit IRC (Quit: Connection closed for inactivity)
12:01 πŸ”— BlueMax has quit IRC (Quit: Leaving)
12:09 πŸ”— jshoard has joined #archiveteam-bs
12:37 πŸ”— Stiletto has joined #archiveteam-bs
12:37 πŸ”— Stilett0 has quit IRC (Ping timeout: 260 seconds)
12:39 πŸ”— yano has quit IRC (Quit: WeeChat, The Better IRC Client, https://weechat.org/)
12:39 πŸ”— yano has joined #archiveteam-bs
12:46 πŸ”— Raccoon has quit IRC (Ping timeout: 265 seconds)
14:11 πŸ”— Mateon1 has quit IRC (Remote host closed the connection)
14:11 πŸ”— Mateon1 has joined #archiveteam-bs
14:42 πŸ”— lennier1 has quit IRC (Read error: No route to host)
14:43 πŸ”— lennier1 has joined #archiveteam-bs
15:56 πŸ”— DogsRNice has joined #archiveteam-bs
16:14 πŸ”— Debiloid has joined #archiveteam-bs
16:15 πŸ”— Debiloid Π­Ρ‰ΠΊΠ΅Ρ€Π΅
16:15 πŸ”— drcd_ has joined #archiveteam-bs
16:15 πŸ”— Debiloid Π«Ρ‹Ρ‹
16:16 πŸ”— Debiloid Ваня Варя А Аня
16:17 πŸ”— JAA Debiloid: How can we help you?
16:17 πŸ”— vanek_shn has joined #archiveteam-bs
16:17 πŸ”— vanek_shn Π³Π΄Π΅ я
16:18 πŸ”— Debiloid Π’Ρ‹ здСсь
16:18 πŸ”— vanek_shn ΠΎ Π΄ΠΈΠ±ΠΈΠ»Π° Π·Π΄Π°Ρ€ΠΎΠ²Π°
16:18 πŸ”— vanek_shn ΠΊΠ°Π²ΠΎ
16:19 πŸ”— Debiloid vanek_shn There people uses english
16:19 πŸ”— vanek_shn кстати
16:19 πŸ”— vanek_shn Π° ΠΌΠ½Π΅ Π½Π°ΡΡ€Π°Ρ‚ΡŒ Π½Π° инглиш
16:19 πŸ”— Debiloid Jaa do you save archive.org.ua?
16:19 πŸ”— vanek_shn ΠΊΠΎΡ€ΠΎΡ‡Π΅ саня
16:19 πŸ”— Debiloid Ukrainin site
16:19 πŸ”— vanek_shn скачай ΠΈΠ³Ρ€Ρƒ ΠΌΠΎΡ€Ρ…ΡƒΡ…Π½ Π»Π΅Π³Π΅Π½Π΄Ρ‹ ΠΊΠ°Ρ€Ρ‚ΠΈΠ½Π³Π°
16:21 πŸ”— Debiloid Jaa, it is like Web-Archive but isnot open
16:21 πŸ”— JAA Debiloid: Interesting website, thank you. I have never seen it before. We do not currently save it. Is it at risk?
16:21 πŸ”— drcd has quit IRC (Ping timeout: 745 seconds)
16:21 πŸ”— drcd_ is now known as drcd
16:21 πŸ”— vanek_shn Π·Π°Ρ‚ΠΊΠ½ΠΈΡΡŒ пСндосина
16:21 πŸ”— vanek_shn ΠΌΠΎΡ€Ρ…ΡƒΡ…Π½
16:22 πŸ”— vanek_shn саня ΠΊΠ°ΠΊ Π½ΠΈΠΊ ΠΏΠΎΠΌΠ΅Π½ΡΡ‚ΡŒ
16:22 πŸ”— Debiloid Jaa some pages are save very badly. vanek_shn later
16:22 πŸ”— vanek_shn jaa Ρ‡ΠΌΠΎ с малСнькой письькой
16:24 πŸ”— vanek_shn сука ΠΌΠΎΡ€Ρ…ΡƒΡ…Π½ Π½Π΅ качаСтся
16:25 πŸ”— Debiloid Jaa also there were pictires but now they are not
16:27 πŸ”— vanek_shn i love fuck cats
16:27 πŸ”— vanek_shn and dogs too
16:27 πŸ”— vanek_shn and JAA
16:27 πŸ”— JAA sets mode: +b *!*4dde6317@*.mibbit.com
16:27 πŸ”— vanek_shn was kicked by JAA (vanek_shn)
16:27 πŸ”— Debiloid Jaa looks like they try to clear their site
16:28 πŸ”— JAA Debiloid: Do you know who operates it?
16:29 πŸ”— Debiloid Jaa there was email
16:29 πŸ”— Debiloid Maybe just send him hello and ask him about pages and everything
16:30 πŸ”— JAA Ah, yes, found it.
16:31 πŸ”— Debiloid If he or she love web-archive he or she can work with you
16:31 πŸ”— JAA It looks like the Internet Archive covered a part of the website in 2015.
16:31 πŸ”— Debiloid With what tool?
16:31 πŸ”— Debiloid By users?
16:32 πŸ”— JAA Their internal crawling tool, Heritrix.
16:32 πŸ”— JAA But that is almost certainly very incomplete.
16:33 πŸ”— Debiloid But 2015 was 5 years ago
16:33 πŸ”— JAA Yeah
16:33 πŸ”— Debiloid There can be new pages
16:33 πŸ”— Debiloid Now Ukraine has new president
16:35 πŸ”— Debiloid Btw, it's just text. No links. Jaa it's not like Web-archive links dont work
16:38 πŸ”— Debiloid Jaa when archive team will ask creator? Some pages looks like shit and not like originals. But older pages are not
16:38 πŸ”— JAA Yes, I saw that. Makes it easier to archive.
16:38 πŸ”— JAA I will look into it in more detail soon.
16:39 πŸ”— Debiloid Its like another collection on web archive fir this site? Like for google plus?
16:39 πŸ”— Debiloid Will there be collection
16:40 πŸ”— JAA Maybe. It depends on how large it is in total.
16:40 πŸ”— JAA I see numbers of 30 and 18 million pages mentioned on the website.
16:41 πŸ”— JAA But if that is all text, it would not be very large. So maybe just one item on Internet Archive.
16:41 πŸ”— Debiloid So will there be collections for webcitation and other?
16:41 πŸ”— Debiloid It will be like archive.org/details/archive.org.ua?
16:42 πŸ”— JAA Something similar to that, yes.
16:42 πŸ”— JAA It will also be in the Wayback Machine at web.archive.org.
16:43 πŸ”— Debiloid Similar? But like what?
16:43 πŸ”— JAA I do not know yet. It depends on the size, when we download it, etc.
16:44 πŸ”— JAA You will see it on https://web.archive.org/web/*/https://archive.org.ua/ when it is done.
16:44 πŸ”— Arcorann has quit IRC (Read error: Connection reset by peer)
16:45 πŸ”— Debiloid But is it too big? Hee are archive-bots for nit very big sites
16:45 πŸ”— JAA I do not know how big it is. Have to download it first.
16:46 πŸ”— Debiloid Okey
16:46 πŸ”— nico_32_ hum https://archive.org.ua/robots.txt
16:47 πŸ”— nico_32_ the current robots.txt ban the index page
16:48 πŸ”— JAA No, it is ineffective.
16:48 πŸ”— JAA It's /search/example.org/, no query.
16:49 πŸ”— Debiloid Jaa maybe just archive-bots? But this archive will not be in its collection
16:49 πŸ”— JAA Maybe /search/?... was a previous website structure. The site worked differently in 2017.
16:49 πŸ”— nico_32_ https://archive.org.ua/s/?s=013.kiev.ua&d=2006-12
16:49 πŸ”— JAA nico_32_: /s/, not /search/ :-)
16:49 πŸ”— Debiloid It will be in archive-bot collection?
16:49 πŸ”— nico_32_ yes
16:49 πŸ”— JAA Debiloid: It is too big for ArchiveBot.
16:50 πŸ”— Debiloid Okey
16:50 πŸ”— nico_32_ i was probably a /search/? => /s/?
16:50 πŸ”— nico_32_ refactor
16:50 πŸ”— JAA Yeah, could be.
16:50 πŸ”— nico_32_ as the collection grown
16:50 πŸ”— JAA Last page of the domain index is https://archive.org.ua/list/?&offset=464600 by the way.
16:51 πŸ”— Debiloid It is like archive of archive
16:55 πŸ”— Debiloid Jaa why https://community.arm.com/ isnt too big when it have abot 25 mln pages but archive.org.ua is too big?
16:55 πŸ”— Debiloid When it have about 18 mln pages you said
16:59 πŸ”— Debiloid A
16:59 πŸ”— Debiloid Jaa?
17:01 πŸ”— Debiloid Ff
17:03 πŸ”— JAA Debiloid: community.arm.com is also too big for ArchiveBot, but I did not know that when I started it.
17:03 πŸ”— JAA It still retrieves data, but it gets slow when a job is more than a few million URLs.
17:05 πŸ”— Debiloid Lol
17:06 πŸ”— Debiloid Matbe save witt many parts
17:06 πŸ”— Debiloid Jaa
17:06 πŸ”— JAA Don't worry, I will find a solution.
17:09 πŸ”— Debiloid has quit IRC (Quit: https://mibbit.com Online IRC Client)
17:14 πŸ”— Mateon1 has quit IRC (Ping timeout: 272 seconds)
17:15 πŸ”— Mateon1 has joined #archiveteam-bs
17:32 πŸ”— JAA I'll throw the TechNet forums, wikis, and (already migrated and "archived") blogs into AB. It appears we never really properly covered those, and while Microsoft said they'll keep it read-only, we all know how that will go.
17:35 πŸ”— JAA By the way, here's an overview of the steps: https://docs.microsoft.com/en-us/teamblog/msdn-technet-migration
17:36 πŸ”— JAA Already outdated as it's from late last year. Gallery was supposed to disappear in June but is still online.
17:43 πŸ”— Debiloid has joined #archiveteam-bs
17:43 πŸ”— Debiloid A
17:44 πŸ”— Debiloid Jaa it also saved something related to google https://archive.org.ua/search/?s=Google
17:45 πŸ”— Debiloid There is also lists of archives https://archive.org.ua/search/?s=archive
17:45 πŸ”— JAA Ah, so this is what the robots.txt blocks, nico_32_. ^
17:46 πŸ”— Debiloid https://archive.org.ua/search/?s=arhiv also this for archives
17:46 πŸ”— Debiloid It may be useful for collecting Ukrain archive-sites
17:46 πŸ”— Debiloid Jaa?
17:47 πŸ”— JAA Yes, and the domain list in general will be quite interesting to get a sample of the Ukranian WWW.
17:48 πŸ”— Debiloid Some of them are russian i think
17:48 πŸ”— JAA Sure, I was referring to the .ua domains.
17:49 πŸ”— Debiloid Not all .ua domains are Ukrain and not all Ukrain domains are .ua
17:49 πŸ”— JAA Yes, that's why I said "sample".
17:50 πŸ”— Debiloid Hmmmm
17:52 πŸ”— Debiloid https://archive.org.ua/search/?s=Bibl jaa related to Bible and libraries
17:54 πŸ”— JAA Debiloid: We will archive the entire website anyway.
17:57 πŸ”— Debiloid has quit IRC (https://mibbit.com Online IRC Client)
18:02 πŸ”— JAA Unsurprisingly, the MSDN and TechNet forums are huge. Both have 2-3 million threads.
18:04 πŸ”— JAA And that's only the en-US ones.
18:11 πŸ”— DLoader has quit IRC (Quit: DLoader)
18:28 πŸ”— DLoader has joined #archiveteam-bs
18:56 πŸ”— nico_32_ JAA: and they already purged a lot of old content
18:58 πŸ”— JAA nico_32_: You mean the relaunch in 2008?
18:59 πŸ”— nico_32_ when they removed win9x & 2k content
18:59 πŸ”— nico_32_ and anything older
19:02 πŸ”— JAA Ah
19:09 πŸ”— JAA Looks like there's overlap between the two. Some threads appear in both.
19:09 πŸ”— JAA I also found https://social.microsoft.com/Forums/, which uses the same platform.
19:11 πŸ”— JAA And there was also something at http://social.expression.microsoft.com/forums/ apparently, but that doesn't seem to exist anymore.
20:08 πŸ”— PovAddict has joined #archiveteam-bs
20:45 πŸ”— Ctrl has quit IRC (Ping timeout: 857 seconds)
21:30 πŸ”— HP_Archiv has joined #archiveteam-bs
21:31 πŸ”— Ctrl has joined #archiveteam-bs
21:32 πŸ”— HP_Archiv has quit IRC (Client Quit)
21:33 πŸ”— HP_Archiv has joined #archiveteam-bs
21:48 πŸ”— Mayeau has joined #archiveteam-bs
21:52 πŸ”— Mayonaise has quit IRC (Read error: Operation timed out)
21:58 πŸ”— HP_Archiv has quit IRC (Quit: Leaving)
22:02 πŸ”— DogsRNice has quit IRC (Ping timeout: 265 seconds)
22:22 πŸ”— superkuh has quit IRC (Read error: Connection reset by peer)
22:35 πŸ”— superkuh has joined #archiveteam-bs
22:46 πŸ”— Arcorann has joined #archiveteam-bs
22:47 πŸ”— Arcorann has quit IRC (Remote host closed the connection)
22:48 πŸ”— Arcorann has joined #archiveteam-bs
22:56 πŸ”— jshoard has quit IRC (Leaving)

irclogger-viewer