[00:03] *** kristian_ has quit IRC (Quit: Leaving) [00:25] Fuuuuuuuuuuuck that guyyyyyyyyyyyyy [00:26] * xmc cries hahahahaha [00:48] whats the website? [00:48] i can't find the name in the post [00:49] apparently its ex.ua [00:50] it's been splooged all over archiveteam channels for the past week, you've seen it already [00:53] *** blahsvill has joined #archiveteam-bs [00:55] *** blahsvill has quit IRC (Client Quit) [01:19] *** zino has quit IRC (Remote host closed the connection) [01:27] ok [01:27] *** russkick has joined #archiveteam-bs [01:47] https://medium.com/@vine/vine-update-59426a5adfab#.vq9jta2m7 Hum. By no means perfect, but that's at least rather more responsible than I'm used to seeing with shutdowns. [01:50] So, the 'microsoft' scam people called... So I went to webrecorder.io and followed their instructions through the virtual web browser on their. [01:51] Which is hosted through amazon web services... [01:51] They were able to get into the virtual system [01:51] Does anyone know whether or not they reuse their virtual systems? [01:54] Also, is their a risk they could get ahold of my machine? [02:00] Hi, Russ from The Memory Hole here, wanting to archive .gov and .mil sites that aren't completely in the wayback. Sometimes this is because of a robots.txt but other times it's apparently because of technical reasons. The National Archives (NARA, www.archives.gov) really needs a full back-up. There's an overwhelming number of PDF docs in there, and most of them aren't in the wayback. [02:00] My tech knowledge is pretty limited, but it looks like NARA usually uses javascript(?) to fetch documents and lists of documents, rather than just using straightforward links. [02:00] First, I think it'd be great to get a full mirror of archives.gov, but that's such a huge job that it probably requires extra preparation, so I won't attempt it myself. [02:01] I would like to archive some smaller sections of it, though, and I wanted to see if there's a special recipe I should use. [02:01] For example, NARA has scanned the CIA's Intelligence Publication Files, 1946 - 1950. This is the bibliographic page for them: https://catalog.archives.gov/id/6882518 [02:01] The wayback can't seem to capture this. It shows a pure white screen. [02:01] Now, on that page, if you click the link "Includes: 150 file unit(s) described in the catalog," you get a dynamic page that lists all the documents. Again, the wayback doesn't capture this. [02:02] Looking at that list, if you click on a document's title, you go to the page for that document. There you can flip through the pages, or you can download the whole thing as a PDF. For example, this is the page for "ORE 1, Soviet Foreign and Military Policy, 07/23/46": https://catalog.archives.gov/id/6924242 [02:02] That page isn't in the wayback either. Neither is "ORE 2": https://catalog.archives.gov/id/6924243 [02:02] My question is whether #archivebot will capture those pages/documents. And if so, is there something special I need to do? [02:05] russkick: not sure if you saw the news in #archiveteam, but robots.txt is now ignored for .gov & .mil [02:06] Nice! I didn't see that. [02:07] But the NARA issue is a technical one. Wayback doesn't appear to capture pages with documents or lists of documents. [02:11] russkick: You mean like "index of" pages? [02:13] Pages like this: https://catalog.archives.gov/id/6924242 [02:13] It contains the document as a PDF and as a flip-through [02:14] *** krazedkat has quit IRC (Quit: Leaving) [02:14] oh. idk [02:26] pdf url of the same item: https://catalog.archives.gov/58ea6b91-adff-4d83-aec0-fd8122a9f17a [02:27] i can find that number in the raw html [02:27] *can't [02:27] I'm still anxious about the scammers getting into my computer, but I kinda know it's pretty unlikely, can someone confirm that? [02:29] *** Somebody1 has quit IRC (Ping timeout: 370 seconds) [02:34] you used a virtual machine right? [02:38] yeah I don't really understand what you're saying [02:39] what virtual system? usually these scams involve getting you to install a teamviewer-like thing that lets them control your PC [02:41] unless things are really, really bad on the Internet now, you're safe unless you have epically bad luck [02:42] Frogging: dashcloud : you have heard of webrecorder.io, right? [02:42] we don't know what exactly you have executed, so we can't tell you for sure [02:42] no, I haven't [02:43] yes- that's the site that saves a web sesssion (usually for saving a copy of a web page) right? [02:43] Yeah [02:44] And they have an option to choose a web browser other than what you are using, which opens up an in-browser virtual machine like thing. [02:44] let's go through what would have to go wrong in order for you to be screwed here (I'm guessing you don't make a habit of messing with scammers right?) [02:44] *** ravetcofx has quit IRC (Read error: Operation timed out) [02:44] What do you mean messing with them? [02:45] This was the remote desktop software they were having me use: https://win.screenconnect.com/ [02:45] do you regularly screw with scammers such that your handle is well-known to them, such that they would even think of coming after you? [02:45] http://www.419eater.com/ [02:46] One time I kept them on the line for an hour and then simply told them that I was going to hang up on them. [02:47] They were VERY angry [02:47] otherwise, they'd need to break out of the VM (a non-trivial task, and very much wasted in this case) before they could have a shot at your computer or they would need to acquire your real IP somehow (difficult, because you're using Amazon's IP block, and if they get your IP from Amazon, we've all got bigger problems) [02:48] assuming you didn't do something silly like use a real name or something that is trivially tied back to your real identity, I think you're fine' [02:48] Use my real name where? [02:48] with the scammers [02:49] I mean, I never told them it, I don't know if they have acquired a phone number list with names somehow though. [02:50] They kept on calling me "mam" [02:50] They always do that xD [02:52] they don't have anything on you, but you shouldn't do this again for your own sake [02:52] Yeah... [02:52] Waste of time, makes me anxious, and other reasons [02:53] Sometime I should say something like "Did you just assume my gender?" after they call me mam. xD [02:54] I don't think they'll understand [02:55] You can have fun with them as long as you state that after you current issue/conflict is solved that you'll keep on going with them [02:55] to keep them motivated like a pig and the carrot [02:55] That's what I did when I kept them on the line for an hour [02:56] Why do they even take jobs like this? [02:57] - when you're good at nothing and need money and have no principles [02:59] Why haven't they been shut down? [03:01] *** ravetcofx has joined #archiveteam-bs [03:03] Did someone die? [03:03] huh? [03:29] dashcloud: I think even if they got his IP it wouldn't do much for them [03:29] *** compu has quit IRC (Read error: Operation timed out) [03:30] unless there's a router admin panel sitting on port 80 with a default password or something :p [03:31] Errr... Even if it did have the default password, the port would need to be open [03:31] yes that was implied [03:32] was poking fun at shitty routers [03:34] Eh, anyway my router no longer uses the default password [03:44] *** VADemon has quit IRC (Quit: left4dead) [03:47] *** dashcloud has quit IRC (Read error: Operation timed out) [03:50] *** dashcloud has joined #archiveteam-bs [03:54] *** ndiddy has quit IRC (Read error: Connection reset by peer) [04:19] *** jrwr has quit IRC (Remote host closed the connection) [04:53] So... The scammers called again... [04:54] I got tons of the codes that they use for the remote desktop applications. Now I'm gonna use them to report them to those websites. [04:55] I also asked him what he thinks about Donald Trump [04:59] you could also not answer [05:01] Yeah, but this way I can get the websites to suspend their accounts, and they'll have to pay for a new one, or just be banned completely from it. [05:03] And if they have to pay more money for accounts, then the less money they'll have to hire people to make calls [05:03] if it's AWS they're using, good luck [05:03] Couldn't the software companies ask amazon to ban them? [05:04] why would any of them care [05:06] I know for a fact that at least some of the software companies do, the paid ones care because if they ban them then they'll have to pay for a new subscription, the free ones might care if they are using a significant amount of their resources. Amazon could maybe get them to have to pay again. [05:06] I guess we all have our windmills [05:08] the sites i got their codes for are screenconnect.com, fastsupport.com, and anydesk [05:13] Although, I'm kinda wondering why they call me so much [05:15] you answer [05:22] yep [05:29] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [05:35] *** Sk1d has joined #archiveteam-bs [05:35] I assumed they would have some sort of system to make notes on people they call... I guess not? [06:55] *** Somebody1 has joined #archiveteam-bs [06:58] *** russkick has quit IRC () [08:12] *** GE has joined #archiveteam-bs [09:02] *** Somebody1 has quit IRC (Read error: Operation timed out) [09:04] *** Honno has joined #archiveteam-bs [10:00] *** zino has joined #archiveteam-bs [10:22] *** DiscantX has joined #archiveteam-bs [10:40] *** GE has quit IRC (Quit: zzz) [10:43] *** ravetcofx has quit IRC (Read error: Operation timed out) [10:44] *** Silvan has joined #archiveteam-bs [10:44] *** SilSte has quit IRC (Read error: Connection reset by peer) [11:05] *** BlueMaxim has quit IRC (Quit: Leaving) [11:09] *** kristian_ has joined #archiveteam-bs [11:31] What to do with old computers very many old computers? [11:46] That depends on how old and what types. If you got any with IMP/4 CPUs I'll take them off your hands. Running out of spares... [11:48] In general old computers are a problem. Not many museums take them. Finding a user group for the type of computer you have is a good idea. [11:48] Hmmm [11:49] Yeah [11:50] zino: what is IMP/4? [11:52] *** dashcloud has quit IRC (Read error: Operation timed out) [11:53] A 4-bit microprocessor made by National Semiconductors in the 70s. They where what was called bit-slicers where you could use 2 to build an 8bit computer and 4 to build a 16-bit one. [11:55] *** dashcloud has joined #archiveteam-bs [12:02] *** dashcloud has quit IRC (Read error: Operation timed out) [12:05] *** dashcloud has joined #archiveteam-bs [12:07] The only reasonably well known computer using them where the IMP-16 built by NS themselves. I'm repairing a lesser known system made in Sweden. [12:13] *** GE has joined #archiveteam-bs [12:15] *** schbirid has joined #archiveteam-bs [13:15] *** GE has quit IRC (Ping timeout: 255 seconds) [13:15] *** GE has joined #archiveteam-bs [13:35] my shit ISP gives me like 100kilobyte/s from github:( [13:48] *** VADemon has joined #archiveteam-bs [13:59] *** GE has quit IRC (Remote host closed the connection) [14:12] *** kristian_ has quit IRC (Quit: Leaving) [14:20] *** vitzli has joined #archiveteam-bs [14:49] *** DiscantX has quit IRC (Read error: Operation timed out) [15:30] *** GE has joined #archiveteam-bs [16:31] *** ndiddy has joined #archiveteam-bs [16:41] *** vitzli has quit IRC (Quit: Leaving) [17:11] *** VADemon has quit IRC (Quit: left4dead) [17:12] *** VADemon has joined #archiveteam-bs [17:34] *** Somebody1 has joined #archiveteam-bs [17:40] *** dashcloud has quit IRC (Read error: Operation timed out) [17:44] *** Somebody1 has quit IRC (Ping timeout: 370 seconds) [17:45] zino: I don't think that old [17:45] interesting information though [17:46] the vague description i've received is ~10 years old [18:48] *** ravetcofx has joined #archiveteam-bs [18:51] *** Somebody1 has joined #archiveteam-bs [18:55] SketchCow, can we have a target setup for ftp-gov please? [20:12] anyone have an interest in 50 TB of scihub/scimag contents, and/or have a copy already? and/or who should i pester that will probably alread have this? [20:12] *already [20:22] kanzure_: only scihub content or non-scihub mags? [20:23] IA supposably already has a copy of scihub [20:23] yea but not one i can use [20:23] "non-scihub mags" not sure what this means [20:50] *** schbirid has quit IRC (Quit: Leaving) [21:06] *** schbirid has joined #archiveteam-bs [21:35] *** DiscantX has joined #archiveteam-bs [21:41] Does IA have any burgers in its archives? http://www.cbc.ca/radio/asithappens/as-it-happens-thursday-edition-1.3896872/how-a-47-year-old-burger-became-a-fixture-in-the-alberta-legislature-1.3896885 [21:52] *** BlueMaxim has joined #archiveteam-bs [22:27] *** Honno has quit IRC (Read error: Operation timed out) [22:35] *** jrwr has joined #archiveteam-bs [22:45] *** Administr has joined #archiveteam-bs [22:47] *** schbirid has quit IRC (Quit: Leaving) [22:50] *** HCross has quit IRC (Ping timeout: 370 seconds) [22:53] *** HCross3 has joined #archiveteam-bs [22:59] *** Administr has quit IRC (Ping timeout: 370 seconds) [23:00] *** Administr has joined #archiveteam-bs [23:03] *** Administr has quit IRC (Client Quit) [23:04] *** HCross3 has quit IRC (Ping timeout: 370 seconds) [23:05] *** HCross has joined #archiveteam-bs [23:05] *** HCross has quit IRC (Client Quit) [23:08] *** HCross has joined #archiveteam-bs [23:13] *** Administr has joined #archiveteam-bs [23:19] *** HCross has quit IRC (Ping timeout: 370 seconds) [23:20] *** Administr has quit IRC (Ping timeout: 370 seconds) [23:22] *** HarryCros has joined #archiveteam-bs [23:33] *** GE has quit IRC (Remote host closed the connection)