[00:20] *** mgrytbak^ has quit IRC (Ping timeout: 492 seconds) [00:32] *** VerifiedJ has quit IRC (Quit: Leaving) [00:45] *** mgrytbak^ has joined #archiveteam-ot [01:01] *** Odd0002_ has joined #archiveteam-ot [01:02] *** Odd0002 has quit IRC (Ping timeout: 252 seconds) [01:02] *** Odd0002_ is now known as Odd0002 [01:21] *** anarcat has quit IRC (Quit: leaving) [01:52] *** Ryz has quit IRC (Quit: ChatZilla 0.9.92-rdmsoft [XULRunner 35.0.1/20150122214805]) [03:16] *** icedice has joined #archiveteam-ot [03:17] Is it safe to stop an external HDD that is being reformated? I don't think it will be done in time for when I have to catch the train. [03:19] yes [03:19] *** Stilett0 has joined #archiveteam-ot [03:19] Good, that's what I figured [03:19] *** Stiletto has quit IRC (Ping timeout: 265 seconds) [03:20] "formatting" a drive by writing to every sector is kind of pointless unless you're doing a write-then-read test to check for bitflips or something [03:20] you can write an empty filesystem to a drive in a few seconds [03:29] *** Stiletto has joined #archiveteam-ot [03:29] *** Stilett0 has quit IRC (Ping timeout: 268 seconds) [03:40] It's good if using the drive for the first time I heard [03:40] In case there's damage or something like that [04:07] *** m007a83 has quit IRC (Read error: Connection reset by peer) [04:30] *** m007a83 has joined #archiveteam-ot [04:32] *** ggus has quit IRC (Ping timeout: 265 seconds) [04:35] *** m007a83 has quit IRC (Read error: Connection reset by peer) [04:35] *** m007a83 has joined #archiveteam-ot [04:37] *** m007a83 has quit IRC (Remote host closed the connection) [04:38] *** m007a83 has joined #archiveteam-ot [04:39] *** hook54321 has quit IRC (Ping timeout: 252 seconds) [04:41] *** hook54321 has joined #archiveteam-ot [04:52] *** odemg has quit IRC (Ping timeout: 260 seconds) [04:54] *** ggus has joined #archiveteam-ot [05:02] *** ivan has quit IRC (Read error: Operation timed out) [05:02] *** vectr0n has quit IRC (Read error: Connection reset by peer) [05:03] *** vectr0n has joined #archiveteam-ot [05:03] *** ivan has joined #archiveteam-ot [05:03] *** svchfoo3 sets mode: +o ivan [05:04] *** odemg has joined #archiveteam-ot [05:11] *** Ryz has joined #archiveteam-ot [05:23] *** Martle has quit IRC (Ping timeout: 252 seconds) [06:01] *** tuluu has quit IRC (Ping timeout: 260 seconds) [06:26] unless you're actually checking for errors it makes no difference [07:39] *** VADemon has quit IRC (Read error: Connection reset by peer) [07:50] *** Odd0002 has quit IRC (ZNC - http://znc.in) [07:52] *** Odd0002 has joined #archiveteam-ot [08:21] "This hooker says you need a hooker." - Jason Scott (2009) [08:21] *** uzerus has joined #archiveteam-ot [08:39] *** BlueMax has quit IRC (Read error: Connection reset by peer) [09:09] *** BlueMax has joined #archiveteam-ot [09:27] *** icedice has quit IRC (Quit: Leaving) [10:52] *** Mateon1 has quit IRC (Ping timeout: 265 seconds) [10:53] *** Mateon1 has joined #archiveteam-ot [11:03] *** BlueMax has quit IRC (Quit: Leaving) [11:27] *** Ryz has quit IRC (Quit: ChatZilla 0.9.92-rdmsoft [XULRunner 35.0.1/20150122214805]) [12:27] From pm: [12:28] [2018-11-14 12:25:16] Horribly [12:28] [2018-11-14 12:25:41] This is the textbook: http://www4.comp.polyu.edu.hk/~comp2421/ComputerOrganizationAndDesign5thEdition2014.pdf [12:28] [2018-11-14 12:25:46] If you wanna read though it [12:28] [2018-11-14 12:25:53] Programming on paper = cancer [12:28] [2018-11-14 12:25:58] Oh well i hope it does not crush some dreams [12:28] [2018-11-14 12:26:19] oh i am hearing something similar [12:28] [2018-11-14 12:26:34] I like my IDE [12:28] [2018-11-14 12:27:04] I like having autocomplete, I like syntax highlighting, I like syntax checking [12:28] [2018-11-14 12:27:20] Those things don't exist on paper, also -ot [12:28] Oh... twoTBHetz isn't here [12:28] *** twoTBHetz has joined #archiveteam-ot [12:29] kiska i hear "computer architecture and computer design" but it is only like 3 hours aweek so likely worth less than your exam [12:29] *** uzerus has quit IRC (Read error: Operation timed out) [12:29] My exam was worth 50% of my mark xD [12:30] oh funny thing i will also have an pre-exam in a few days where i can get bonus points for the exam [12:31] (of the "same" class [12:32] Now I go back to marking the HSC [12:32] HSC? [12:33] Higher School Certificate [12:33] NSW's completion of high school thing [12:36] Anyway its 11pm, and I have to be up at 8am to mark ~80k exams in 3 weeks [12:38] ahh ... i find the thought that students tests are marked by anything but the teacher horrible [12:38] Don't worry I am cross checked with 4 other markers [12:39] this is not my problem ... [12:39] Eg A question worth 5 marks, and I give 3.4, the mark I give has to be ±3% of the other markers [12:40] So in that example it would be 3.298 and 3.502 out 5 marks [12:41] i suspect that you won't give me feedback on my doc then. When could i check back for that? (I tried to use it as an oppertunity to learn more about documenation). [12:41] So what is this meant to say "How did better come on to being?" I am assuming you rushed though the process? [12:42] Give me a few minutes so I can try and improve upon your doc [12:44] kiska no it is my first time helping out [12:44] feed back is appretiated [12:47] I would suggest as an examiner looking at your document is to spend more time writing it. A lot of the issues I see are grammar and broken sentences. Like that example I posted earlier, what is that meant to say? [12:48] oh ok ... the documentation was rushed ... [12:48] ivan: I took a quick look at terastash. It does indeed seem like a lot of custom software. [12:49] Kinda surprised by the use of Javascript also. [12:49] I am talking about the file "better" which was published as the final list [12:49] It reads very rushed, and the goal of documentation is to make sure other people understand what you have done. And whilst I know you have used sublist3r, what if I didn't? [12:50] Then this sentence "sublist/sense/* is sublist/raw/* expect that i filtered out dead links via header reqests." makes no sense to me whatsoever [12:50] ivan: I noticed the readme talks about rclone. I'm not super familiar with rclone, but is there a clear advantage to terastash over rclone? [12:54] *** twoTBHetz has quit IRC (Read error: Operation timed out) [12:55] *** twoTBHetz has joined #archiveteam-ot [13:02] kiska back [13:02] ok [13:03] i read what i missed [13:04] twoTBHetz So with the limited time I have with your doc, I have sort of summarised it. Here it is: https://pad.riseup.net/p/oDrnpz31VOOa-keep [13:05] Your more than welcome to edit the file. [13:05] i wanted to say that the files in raw/* are the same as the files in * except that i threw a header request against each domain and removed the once which did not respond [13:05] ok [13:06] Also order is important as well. I believe you went in reverse order? [13:06] Yep i did [13:07] With documentation, you want to order what you did, in chronological order from first thing you did to last [13:08] Its no good to the person reading it, if you make up some order that the reader doesn't know [13:10] twoTBHetz here are some documentation principles: https://www.writethedocs.org/guide/writing/docs-principles/ [13:15] twoTBHetz: Anyway if you continue to write, I can possibly correct some grammar mistakes when I wake up in the morning [13:16] thanks for taking the time [14:23] *** twoTBHetz has quit IRC (Read error: Connection reset by peer) [15:44] *** martini has joined #archiveteam-ot [16:57] jodizzle: maybe just local metadata. rclone didn't exist when I wrote this, and I still haven't looked at it [16:57] I have to maintain this until Google kicks me out just because I have so much data in it already, heh [17:05] *** VerifiedJ has joined #archiveteam-ot [17:14] *** wp494 has quit IRC (Read error: Operation timed out) [17:15] *** wp494 has joined #archiveteam-ot [17:28] *** LFlare has joined #archiveteam-ot [17:31] *** LFlarey has joined #archiveteam-ot [17:32] *** LFlare has quit IRC (Ping timeout: 252 seconds) [17:40] *** LFlarey has quit IRC (Ping timeout: 506 seconds) [17:40] *** LFlarey has joined #archiveteam-ot [17:46] *** anarcat has joined #archiveteam-ot [17:47] *** LFlarey has quit IRC (Ping timeout: 265 seconds) [17:47] *** LFlarey has joined #archiveteam-ot [17:56] *** LFlarey has quit IRC (Ping timeout: 506 seconds) [18:00] *** LFlarey has joined #archiveteam-ot [18:27] *** schbirid has joined #archiveteam-ot [18:28] *** Martle has joined #archiveteam-ot [19:11] *** Ryz has joined #archiveteam-ot [19:58] does anyone have a list of influential twitter users, I'm preparing a second batch of users to archive [19:58] there are too many twitter accounts :( [19:58] I might as well add everyone that wikileaks is following [20:15] that's https://gist.github.com/ivan/6cee6834f5c3a6cb7402836dbb860617 if anyone wants it [20:49] *** tuluu has joined #archiveteam-ot [21:22] *** Martle_ has joined #archiveteam-ot [21:22] *** Martle has quit IRC (Read error: Connection reset by peer) [21:36] *** Martle_ has quit IRC (Quit: Leaving) [21:53] ivan: Add dril as well [21:55] *** maxiPsych has quit IRC (Ping timeout: 268 seconds) [21:58] *** BlueMax has joined #archiveteam-ot [22:09] I've got dril [22:19] *** schbirid has quit IRC (Remote host closed the connection) [22:28] *** Martle has joined #archiveteam-ot [22:39] ivan: here is a list of 130k twitter accounts of notable people (people with Wikipedia article) https://www.archiveteam.org/index.php?title=Wikidata_lists [22:42] lol not found, transfer.sh link expired [22:43] Should be somewhere in the ArchiveBot collection though. [22:43] The file, not a snapshot of the transfer.sh URL. [22:44] https://archive.org/download/archiveteam_archivebot_go_20180924220001/urls-transfer.sh-wikidata-twitter-133k.txt-shallow-20180922-142608-b6ug6-urls.txt [22:45] i have just found that the list isn't deduplicated [22:45] ivan: another at-risk, but nonessential youtube account: [22:45] https://www.youtube.com/slycoopermovie [22:46] it's a film, so it's probably backed up to the hills, but it's onlt two videos (two versions of the trailer) and the 3D version is less-spread than the regular version [22:46] VoynichCr: FYI, ArchiveBot/wpull does deduplicate. [22:48] JAA: good [22:51] betamax: added [22:54] *** Jens has quit IRC (Read error: Connection reset by peer) [22:54] *** Jens has joined #archiveteam-ot [23:41] *** martini has quit IRC (Read error: Operation timed out) [23:44] *** Martle has quit IRC (Ping timeout: 252 seconds) [23:46] *** Martle has joined #archiveteam-ot [23:53] *** BlueMax has quit IRC (Ping timeout: 260 seconds)