#archiveteam-bs 2016-12-17,Sat

↑back Search

Time Nickname Message
00:03 🔗 kristian_ has quit IRC (Quit: Leaving)
00:25 🔗 SketchCow Fuuuuuuuuuuuck that guyyyyyyyyyyyyy
00:26 🔗 * xmc cries hahahahaha
00:48 🔗 godane whats the website?
00:48 🔗 godane i can't find the name in the post
00:49 🔗 kniffy apparently its ex.ua
00:50 🔗 xmc it's been splooged all over archiveteam channels for the past week, you've seen it already
00:53 🔗 blahsvill has joined #archiveteam-bs
00:55 🔗 blahsvill has quit IRC (Client Quit)
01:19 🔗 zino has quit IRC (Remote host closed the connection)
01:27 🔗 godane ok
01:27 🔗 russkick has joined #archiveteam-bs
01:47 🔗 pikhq https://medium.com/@vine/vine-update-59426a5adfab#.vq9jta2m7 Hum. By no means perfect, but that's at least rather more responsible than I'm used to seeing with shutdowns.
01:50 🔗 hook54321 So, the 'microsoft' scam people called... So I went to webrecorder.io and followed their instructions through the virtual web browser on their.
01:51 🔗 hook54321 Which is hosted through amazon web services...
01:51 🔗 hook54321 They were able to get into the virtual system
01:51 🔗 hook54321 Does anyone know whether or not they reuse their virtual systems?
01:54 🔗 hook54321 Also, is their a risk they could get ahold of my machine?
02:00 🔗 russkick Hi, Russ from The Memory Hole here, wanting to archive .gov and .mil sites that aren't completely in the wayback. Sometimes this is because of a robots.txt but other times it's apparently because of technical reasons. The National Archives (NARA, www.archives.gov) really needs a full back-up. There's an overwhelming number of PDF docs in there, and most of them aren't in the wayback.
02:00 🔗 russkick My tech knowledge is pretty limited, but it looks like NARA usually uses javascript(?) to fetch documents and lists of documents, rather than just using straightforward links.
02:00 🔗 russkick First, I think it'd be great to get a full mirror of archives.gov, but that's such a huge job that it probably requires extra preparation, so I won't attempt it myself.
02:01 🔗 russkick I would like to archive some smaller sections of it, though, and I wanted to see if there's a special recipe I should use.
02:01 🔗 russkick For example, NARA has scanned the CIA's Intelligence Publication Files, 1946 - 1950. This is the bibliographic page for them: https://catalog.archives.gov/id/6882518
02:01 🔗 russkick The wayback can't seem to capture this. It shows a pure white screen.
02:01 🔗 russkick Now, on that page, if you click the link "Includes: 150 file unit(s) described in the catalog," you get a dynamic page that lists all the documents. Again, the wayback doesn't capture this.
02:02 🔗 russkick Looking at that list, if you click on a document's title, you go to the page for that document. There you can flip through the pages, or you can download the whole thing as a PDF. For example, this is the page for "ORE 1, Soviet Foreign and Military Policy, 07/23/46": https://catalog.archives.gov/id/6924242
02:02 🔗 russkick That page isn't in the wayback either. Neither is "ORE 2": https://catalog.archives.gov/id/6924243
02:02 🔗 russkick My question is whether #archivebot will capture those pages/documents. And if so, is there something special I need to do?
02:05 🔗 dashcloud russkick: not sure if you saw the news in #archiveteam, but robots.txt is now ignored for .gov & .mil
02:06 🔗 russkick Nice! I didn't see that.
02:07 🔗 russkick But the NARA issue is a technical one. Wayback doesn't appear to capture pages with documents or lists of documents.
02:11 🔗 hook54321 russkick: You mean like "index of" pages?
02:13 🔗 russkick Pages like this: https://catalog.archives.gov/id/6924242
02:13 🔗 russkick It contains the document as a PDF and as a flip-through
02:14 🔗 krazedkat has quit IRC (Quit: Leaving)
02:14 🔗 hook54321 oh. idk
02:26 🔗 godane pdf url of the same item: https://catalog.archives.gov/58ea6b91-adff-4d83-aec0-fd8122a9f17a
02:27 🔗 godane i can find that number in the raw html
02:27 🔗 godane *can't
02:27 🔗 hook54321 I'm still anxious about the scammers getting into my computer, but I kinda know it's pretty unlikely, can someone confirm that?
02:29 🔗 Somebody1 has quit IRC (Ping timeout: 370 seconds)
02:34 🔗 dashcloud you used a virtual machine right?
02:38 🔗 Frogging yeah I don't really understand what you're saying
02:39 🔗 Frogging what virtual system? usually these scams involve getting you to install a teamviewer-like thing that lets them control your PC
02:41 🔗 dashcloud unless things are really, really bad on the Internet now, you're safe unless you have epically bad luck
02:42 🔗 hook54321 Frogging: dashcloud : you have heard of webrecorder.io, right?
02:42 🔗 VADemon we don't know what exactly you have executed, so we can't tell you for sure
02:42 🔗 Frogging no, I haven't
02:43 🔗 dashcloud yes- that's the site that saves a web sesssion (usually for saving a copy of a web page) right?
02:43 🔗 hook54321 Yeah
02:44 🔗 hook54321 And they have an option to choose a web browser other than what you are using, which opens up an in-browser virtual machine like thing.
02:44 🔗 dashcloud let's go through what would have to go wrong in order for you to be screwed here (I'm guessing you don't make a habit of messing with scammers right?)
02:44 🔗 ravetcofx has quit IRC (Read error: Operation timed out)
02:44 🔗 hook54321 What do you mean messing with them?
02:45 🔗 hook54321 This was the remote desktop software they were having me use: https://win.screenconnect.com/
02:45 🔗 dashcloud do you regularly screw with scammers such that your handle is well-known to them, such that they would even think of coming after you?
02:45 🔗 Frogging http://www.419eater.com/
02:46 🔗 hook54321 One time I kept them on the line for an hour and then simply told them that I was going to hang up on them.
02:47 🔗 hook54321 They were VERY angry
02:47 🔗 dashcloud otherwise, they'd need to break out of the VM (a non-trivial task, and very much wasted in this case) before they could have a shot at your computer or they would need to acquire your real IP somehow (difficult, because you're using Amazon's IP block, and if they get your IP from Amazon, we've all got bigger problems)
02:48 🔗 dashcloud assuming you didn't do something silly like use a real name or something that is trivially tied back to your real identity, I think you're fine'
02:48 🔗 hook54321 Use my real name where?
02:48 🔗 dashcloud with the scammers
02:49 🔗 hook54321 I mean, I never told them it, I don't know if they have acquired a phone number list with names somehow though.
02:50 🔗 hook54321 They kept on calling me "mam"
02:50 🔗 hook54321 They always do that xD
02:52 🔗 dashcloud they don't have anything on you, but you shouldn't do this again for your own sake
02:52 🔗 hook54321 Yeah...
02:52 🔗 hook54321 Waste of time, makes me anxious, and other reasons
02:53 🔗 hook54321 Sometime I should say something like "Did you just assume my gender?" after they call me mam. xD
02:54 🔗 hook54321 I don't think they'll understand
02:55 🔗 VADemon You can have fun with them as long as you state that after you current issue/conflict is solved that you'll keep on going with them
02:55 🔗 VADemon to keep them motivated like a pig and the carrot
02:55 🔗 hook54321 That's what I did when I kept them on the line for an hour
02:56 🔗 hook54321 Why do they even take jobs like this?
02:57 🔗 VADemon - when you're good at nothing and need money and have no principles
02:59 🔗 hook54321 Why haven't they been shut down?
03:01 🔗 ravetcofx has joined #archiveteam-bs
03:03 🔗 SketchCow Did someone die?
03:03 🔗 hook54321 huh?
03:29 🔗 Frogging dashcloud: I think even if they got his IP it wouldn't do much for them
03:29 🔗 compu has quit IRC (Read error: Operation timed out)
03:30 🔗 Frogging unless there's a router admin panel sitting on port 80 with a default password or something :p
03:31 🔗 hook54321 Errr... Even if it did have the default password, the port would need to be open
03:31 🔗 Frogging yes that was implied
03:32 🔗 Frogging was poking fun at shitty routers
03:34 🔗 hook54321 Eh, anyway my router no longer uses the default password
03:44 🔗 VADemon has quit IRC (Quit: left4dead)
03:47 🔗 dashcloud has quit IRC (Read error: Operation timed out)
03:50 🔗 dashcloud has joined #archiveteam-bs
03:54 🔗 ndiddy has quit IRC (Read error: Connection reset by peer)
04:19 🔗 jrwr has quit IRC (Remote host closed the connection)
04:53 🔗 hook54321 So... The scammers called again...
04:54 🔗 hook54321 I got tons of the codes that they use for the remote desktop applications. Now I'm gonna use them to report them to those websites.
04:55 🔗 hook54321 I also asked him what he thinks about Donald Trump
04:59 🔗 yipdw you could also not answer
05:01 🔗 hook54321 Yeah, but this way I can get the websites to suspend their accounts, and they'll have to pay for a new one, or just be banned completely from it.
05:03 🔗 hook54321 And if they have to pay more money for accounts, then the less money they'll have to hire people to make calls
05:03 🔗 yipdw if it's AWS they're using, good luck
05:03 🔗 hook54321 Couldn't the software companies ask amazon to ban them?
05:04 🔗 yipdw why would any of them care
05:06 🔗 hook54321 I know for a fact that at least some of the software companies do, the paid ones care because if they ban them then they'll have to pay for a new subscription, the free ones might care if they are using a significant amount of their resources. Amazon could maybe get them to have to pay again.
05:06 🔗 yipdw I guess we all have our windmills
05:08 🔗 hook54321 the sites i got their codes for are screenconnect.com, fastsupport.com, and anydesk
05:13 🔗 hook54321 Although, I'm kinda wondering why they call me so much
05:15 🔗 yipdw you answer
05:22 🔗 Frogging yep
05:29 🔗 Sk1d has quit IRC (Ping timeout: 194 seconds)
05:35 🔗 Sk1d has joined #archiveteam-bs
05:35 🔗 hook54321 I assumed they would have some sort of system to make notes on people they call... I guess not?
06:55 🔗 Somebody1 has joined #archiveteam-bs
06:58 🔗 russkick has quit IRC ()
08:12 🔗 GE has joined #archiveteam-bs
09:02 🔗 Somebody1 has quit IRC (Read error: Operation timed out)
09:04 🔗 Honno has joined #archiveteam-bs
10:00 🔗 zino has joined #archiveteam-bs
10:22 🔗 DiscantX has joined #archiveteam-bs
10:40 🔗 GE has quit IRC (Quit: zzz)
10:43 🔗 ravetcofx has quit IRC (Read error: Operation timed out)
10:44 🔗 Silvan has joined #archiveteam-bs
10:44 🔗 SilSte has quit IRC (Read error: Connection reset by peer)
11:05 🔗 BlueMaxim has quit IRC (Quit: Leaving)
11:09 🔗 kristian_ has joined #archiveteam-bs
11:31 🔗 fie What to do with old computers very many old computers?
11:46 🔗 zino That depends on how old and what types. If you got any with IMP/4 CPUs I'll take them off your hands. Running out of spares...
11:48 🔗 zino In general old computers are a problem. Not many museums take them. Finding a user group for the type of computer you have is a good idea.
11:48 🔗 fie Hmmm
11:49 🔗 fie Yeah
11:50 🔗 fie zino: what is IMP/4?
11:52 🔗 dashcloud has quit IRC (Read error: Operation timed out)
11:53 🔗 zino A 4-bit microprocessor made by National Semiconductors in the 70s. They where what was called bit-slicers where you could use 2 to build an 8bit computer and 4 to build a 16-bit one.
11:55 🔗 dashcloud has joined #archiveteam-bs
12:02 🔗 dashcloud has quit IRC (Read error: Operation timed out)
12:05 🔗 dashcloud has joined #archiveteam-bs
12:07 🔗 zino The only reasonably well known computer using them where the IMP-16 built by NS themselves. I'm repairing a lesser known system made in Sweden.
12:13 🔗 GE has joined #archiveteam-bs
12:15 🔗 schbirid has joined #archiveteam-bs
13:15 🔗 GE has quit IRC (Ping timeout: 255 seconds)
13:15 🔗 GE has joined #archiveteam-bs
13:35 🔗 schbirid my shit ISP gives me like 100kilobyte/s from github:(
13:48 🔗 VADemon has joined #archiveteam-bs
13:59 🔗 GE has quit IRC (Remote host closed the connection)
14:12 🔗 kristian_ has quit IRC (Quit: Leaving)
14:20 🔗 vitzli has joined #archiveteam-bs
14:49 🔗 DiscantX has quit IRC (Read error: Operation timed out)
15:30 🔗 GE has joined #archiveteam-bs
16:31 🔗 ndiddy has joined #archiveteam-bs
16:41 🔗 vitzli has quit IRC (Quit: Leaving)
17:11 🔗 VADemon has quit IRC (Quit: left4dead)
17:12 🔗 VADemon has joined #archiveteam-bs
17:34 🔗 Somebody1 has joined #archiveteam-bs
17:40 🔗 dashcloud has quit IRC (Read error: Operation timed out)
17:44 🔗 Somebody1 has quit IRC (Ping timeout: 370 seconds)
17:45 🔗 fie zino: I don't think that old
17:45 🔗 fie interesting information though
17:46 🔗 fie the vague description i've received is ~10 years old
18:48 🔗 ravetcofx has joined #archiveteam-bs
18:51 🔗 Somebody1 has joined #archiveteam-bs
18:55 🔗 HCross SketchCow, can we have a target setup for ftp-gov please?
20:12 🔗 kanzure_ anyone have an interest in 50 TB of scihub/scimag contents, and/or have a copy already? and/or who should i pester that will probably alread have this?
20:12 🔗 kanzure_ *already
20:22 🔗 Medowar kanzure_: only scihub content or non-scihub mags?
20:23 🔗 Medowar IA supposably already has a copy of scihub
20:23 🔗 kanzure_ yea but not one i can use
20:23 🔗 kanzure_ "non-scihub mags" not sure what this means
20:50 🔗 schbirid has quit IRC (Quit: Leaving)
21:06 🔗 schbirid has joined #archiveteam-bs
21:35 🔗 DiscantX has joined #archiveteam-bs
21:41 🔗 DiscantX Does IA have any burgers in its archives? http://www.cbc.ca/radio/asithappens/as-it-happens-thursday-edition-1.3896872/how-a-47-year-old-burger-became-a-fixture-in-the-alberta-legislature-1.3896885
21:52 🔗 BlueMaxim has joined #archiveteam-bs
22:27 🔗 Honno has quit IRC (Read error: Operation timed out)
22:35 🔗 jrwr has joined #archiveteam-bs
22:45 🔗 Administr has joined #archiveteam-bs
22:47 🔗 schbirid has quit IRC (Quit: Leaving)
22:50 🔗 HCross has quit IRC (Ping timeout: 370 seconds)
22:53 🔗 HCross3 has joined #archiveteam-bs
22:59 🔗 Administr has quit IRC (Ping timeout: 370 seconds)
23:00 🔗 Administr has joined #archiveteam-bs
23:03 🔗 Administr has quit IRC (Client Quit)
23:04 🔗 HCross3 has quit IRC (Ping timeout: 370 seconds)
23:05 🔗 HCross has joined #archiveteam-bs
23:05 🔗 HCross has quit IRC (Client Quit)
23:08 🔗 HCross has joined #archiveteam-bs
23:13 🔗 Administr has joined #archiveteam-bs
23:19 🔗 HCross has quit IRC (Ping timeout: 370 seconds)
23:20 🔗 Administr has quit IRC (Ping timeout: 370 seconds)
23:22 🔗 HarryCros has joined #archiveteam-bs
23:33 🔗 GE has quit IRC (Remote host closed the connection)

irclogger-viewer