[00:07] *** Zerote has quit IRC (Read error: Operation timed out) [00:10] *** BlueMax has joined #archiveteam-bs [00:17] *** ColdIce has quit IRC (Remote host closed the connection) [00:17] *** ColdIce has joined #archiveteam-bs [01:04] *** Lord_Nigh has quit IRC (Ping timeout: 252 seconds) [01:09] *** Lord_Nigh has joined #archiveteam-bs [01:27] Is the webroasting project still active? [01:34] *** wyatt8740 has joined #archiveteam-bs [01:39] *** ATrescue3 has joined #archiveteam-bs [01:42] Sorry, my IRC client automatically reconnected. I will leave immediately. [01:43] (Because my IP address changes every day due to internet provider.) [01:45] Before I leave: JAA: PurpleSym: Kaz: Before judging too early and thinking “not him again”, please be open-minded and read the answers: https://gist.github.com/ATRescue/bd40310ddf6c34ba68356fbd0f37352f – Thank you. [01:45] *** ATrescue3 has quit IRC (Quit: Leaving) [01:45] I mean uh [01:45] Ballsy. [01:48] And I have never personally attacked any ArchiveTeam member. [01:48] less J.A.A. (Just Always Angry) is found like 5 or 6 lines us [01:54] I mean personally I have always seen ArchiveTeam as fair (Even if I wasnt happy with some of the decisions) But i respect them. (Was also an interesting read that) [01:54] looks like someone's crusing for a wildcard ban, impressive [01:55] I think I almost got one of those once. I was lucky [01:55] (and pulled my head in somewhat) [01:56] Have we had many wildcard bans in the past? [01:58] looks like 8 in the current list [01:58] What list? there is a list? [01:59] How many times has my name been written then crossed out :-P [01:59] just in the banlist for the channel [02:00] who knows historically, people eventually give up [02:04] I never did find out if the archiveteam grand council came to a proper decision on me [02:23] i have no reason to not believe ATrescue at "client automatically reconnected." since now is usually the time his 24h forced reconnect happens, but dropping that link and pinging some people before leaving again is IMHO a ban evasion. the gist containing a message that SketchCow wrote three minutes after kicking ATr also suggests that he either has another client connected around or someone forwarding him [02:23] messages from here, in the latter case: congratulations you fucktard, whoever you are, you are entitled to leave as well now. [02:25] Good pick up Fusl [02:26] Fusl which message? [02:26] "I just super-blocked him." [02:29] Are the logs for this channel public? [02:32] haha looks like someone's crusing for a wildcard ban, impressive [02:32] I still think I narrowly avoided one of them🤔 [02:35] *** wyatt8740 has quit IRC (Quit: Ceci n'est pas un IRC quit message.) [02:36] it might be various substances or there is damn nice irony if there would be a public log site and people complain about it... as archiveteam :p (not related to before, just lold irl haha) [02:36] lol [02:36] i mean yea, this is live https://archive.fart.website/bin/irclogger_logs/archiveteam-bs [02:37] but not exactly user friendly as the IRC log interfaces from like a decade ago even [02:37] oh yeah they are [02:37] ohai wm_ [02:37] didnt expect you around here [02:37] yea i'm here but no worry, was since... years i guess lol [02:37] i have.. efnet history for a shitload of warez [02:38] topic probably never came up with us haha [02:38] good point [02:39] can/should someone archive https://scenenotice.org/browse.php ? [02:39] SSL broken and no idea if it qualifies, but such content gets extremely rare sadly [02:39] that would be #archivebot, not sure though if we archive stuff of questionable legality [02:39] not the case [02:40] well, not for the NFOs [02:40] It depends on how questionable it is [02:40] some attachments maybe mhm [02:40] Xrel demonstrated NFOs are legal, its just text [02:40] also, seems that there was a grab in 2015 https://archive.fart.website/archivebot/viewer/job/bamhk [02:41] nice, need to get a copy before i forget... i doubt many are interested but it was a fascinating world at the time [02:42] there have also been from this year. check https://archive.fart.website/archivebot/viewer/domain/scenenotice.org [02:43] so yeah i'd say that content is pretty much safe [02:44] yup seems good, and the Welcome to the Scene mini series is since 2016 on archive.org, replacing the crappy Gulli mirrors ( https://archive.org/details/welcome_to_the_scene_season_1 for anyone interested) [03:11] *** killsushi has quit IRC (Quit: Leaving) [03:13] *** t3 has quit IRC (Quit: Connection closed for inactivity) [03:18] *** qw3rty114 has joined #archiveteam-bs [03:24] *** qw3rty113 has quit IRC (Read error: Operation timed out) [03:43] *** t3 has joined #archiveteam-bs [03:55] *** odemgi_ has joined #archiveteam-bs [03:57] *** odemgi has quit IRC (Ping timeout: 252 seconds) [04:14] do we have plans to archive ghostbin? (shut down notice here: https://ghostbin.com/) [04:16] sounds hard? pastebinsshould not have public content listings, usually [04:18] I'm guessing we would likely use a combination of discovering links through stuff like twitter, reddit, etc, and warrior-based discovery. [04:18] however it's also behind cloudflare :/ [05:04] it being behind cf is bad but not a showstopper if we wanted to do it anyways [05:04] it says everything expires in 48h [05:05] or am i missing something [05:10] astrid: Because of shutting down, new pastes in that website will expire in 48 hours - it didn't in the past: https://web.archive.org/web/20190417131739/https://ghostbin.com/ [05:10] ah yeah ok [05:11] *** brayden has quit IRC (Ping timeout: 615 seconds) [05:12] *** brayden has joined #archiveteam-bs [05:15] idk i remain dubious of the value of archiving every pastebin clone, especially because they are often used with the expectation that data there SHOULD only last a few hours [05:15] and that feels like betraying the intent of the user [05:15] he/she has a point please circle which is correct [05:15] ? [05:16] I meant you had a point but I was unsure of pronoun [05:16] she [05:16] still can't really parse [05:17] I personally use 'they' to those that I don't know people I interact with enough~ [05:18] in my mind archiveteam is about preserving that which was published and is in danger of callous deletion by platforms who have decided "fuck you i am out of money" [05:18] archiving every paste on every pastebin feels ... out of place [05:19] i will however not stop you, merely question in public [05:19] Anyway; in regards of Pastebin-likes, what are your thoughts on archiving the public part of stuff? I mean, Pastebin has an option to have 'em unlisted [05:22] I mean if we do make a project I propose a name like pasteandcopy as a reversal of copy and paste [05:28] *** wyatt8740 has joined #archiveteam-bs [05:33] *** godane1 has quit IRC (Ping timeout: 265 seconds) [05:37] maybe one should note - i run a pastebin also - thattext compresses obviously extremely well [05:38] so moving and storing is probably negligible, finding the URLs and avoiding issues like leaks with a random trial & error generator... what you get from forums and social media might be next to nothing or everything without anyone disclosing data [05:53] #copybin [06:03] #ghosted-bin ? [06:11] #binned? [06:12] *** Zerote has joined #archiveteam-bs [06:24] SketchCow: i can haz write access to the flickr collection plz? [06:33] I'm assuming I made some errors while making that page :/ [06:45] how about #ghostedbin ? (Unless we want a single channel for pastebin-like services) [07:11] *** odemgi_ has quit IRC (Remote host closed the connection) [07:11] *** odemgi_ has joined #archiveteam-bs [07:30] *** Zerote has quit IRC (Ping timeout: 600 seconds) [07:35] *** Zerote has joined #archiveteam-bs [07:35] *** _niklas_ has quit IRC (Read error: Connection reset by peer) [07:45] *** _niklas has joined #archiveteam-bs [08:00] *** wyatt8740 has quit IRC (Read error: Operation timed out) [08:19] *** deevious has quit IRC (Quit: deevious) [08:29] *** deevious has joined #archiveteam-bs [09:44] *** godane has joined #archiveteam-bs [09:45] SketchCow; i'm digitizing 3 tapes i got from savers earlier this week [09:45] there reader digest tapes released in 1988 and hard cover ones [09:46] grand canyon, yosemite, yellowstone is on what i got [10:07] *** _niklas has quit IRC (Read error: Operation timed out) [10:08] *** MrRadar2 has quit IRC (Read error: Operation timed out) [10:08] *** _niklas has joined #archiveteam-bs [10:09] *** Frogging has quit IRC (Ping timeout: 268 seconds) [10:09] *** BnAboyZ has quit IRC (Ping timeout: 268 seconds) [10:10] *** bsmith093 has quit IRC (Read error: Operation timed out) [10:10] *** Dallas has quit IRC (Ping timeout: 268 seconds) [10:11] *** Fusl has quit IRC (Read error: Operation timed out) [10:11] *** Mateon1 has quit IRC (Ping timeout: 268 seconds) [10:11] *** Fusl has joined #archiveteam-bs [10:11] *** svchfoo1 sets mode: +o Fusl [10:11] *** svchfoo3 sets mode: +o Fusl [10:11] *** Jonimus has quit IRC (Ping timeout: 268 seconds) [10:11] *** Mateon1 has joined #archiveteam-bs [10:13] *** Frogging has joined #archiveteam-bs [10:13] *** kisspunch has quit IRC (Ping timeout: 268 seconds) [10:13] *** noirscape has quit IRC (Read error: Operation timed out) [10:13] *** bsmith093 has joined #archiveteam-bs [10:14] *** mundus201 has quit IRC (Ping timeout: 268 seconds) [10:14] *** Xibalba has quit IRC (Ping timeout: 268 seconds) [10:16] *** kisspunch has joined #archiveteam-bs [10:17] *** Tenebrae has quit IRC (Ping timeout: 268 seconds) [10:17] *** VoynichCr has quit IRC (Ping timeout: 268 seconds) [10:17] *** Fusl_ has quit IRC (Ping timeout: 365 seconds) [10:19] *** Fusl_ has joined #archiveteam-bs [10:19] *** Fusl_ has quit IRC (Client Quit) [10:20] *** Tenebrae has joined #archiveteam-bs [10:20] *** godane has quit IRC (Ping timeout: 268 seconds) [10:20] *** mundus201 has joined #archiveteam-bs [10:21] *** godane has joined #archiveteam-bs [10:21] *** Fusl_ has joined #archiveteam-bs [10:21] *** godane has quit IRC (Client Quit) [10:23] *** Xibalba has joined #archiveteam-bs [10:23] SketchCow: you're coming to the Netherlands? Whereabouts/what for? [10:24] *** Jonimus has joined #archiveteam-bs [10:24] *** svchfoo1 sets mode: +o Jonimus [10:24] *** svchfoo3 sets mode: +o Jonimus [10:26] *** kisspunch has quit IRC (Ping timeout: 268 seconds) [10:26] *** Tenebrae has quit IRC (Ping timeout: 268 seconds) [10:27] *** Ganonmast has quit IRC (Ping timeout: 268 seconds) [10:30] *** Xibalba has quit IRC (Ping timeout: 268 seconds) [10:30] *** BnAboyZ has joined #archiveteam-bs [10:30] *** Dallas has joined #archiveteam-bs [10:30] *** noirscape has joined #archiveteam-bs [10:30] *** godane has joined #archiveteam-bs [10:30] *** VoynichCr has joined #archiveteam-bs [10:30] *** Tenebrae has joined #archiveteam-bs [10:30] *** MrRadar2 has joined #archiveteam-bs [10:31] *** svchfoo1 sets mode: +o MrRadar2 [10:32] *** kisspunch has joined #archiveteam-bs [10:34] *** Xibalba has joined #archiveteam-bs [10:35] *** Ganonmast has joined #archiveteam-bs [10:36] *** Despatche has quit IRC (Quit: Read error: Connection reset by deer) [10:42] *** BlueMax has quit IRC (Read error: Connection reset by peer) [10:54] *** MrRadar2 has quit IRC (Read error: Operation timed out) [11:03] *** Ryz has quit IRC (Remote host closed the connection) [13:23] *** VerifiedJ has joined #archiveteam-bs [13:41] *** fredgido has quit IRC (Read error: Connection reset by peer) [13:41] *** fredgido has joined #archiveteam-bs [13:43] i finally found something japanese in google drive : https://drive.google.com/file/d/0B_Bf3673FOzNdDM3QnFHSS02YUE/edit [13:48] *** deevious has quit IRC (Remote host closed the connection) [13:48] *** zhongfu has quit IRC (Quit: No Ping reply in 180 seconds.) [13:50] *** zhongfu has joined #archiveteam-bs [13:58] *** zhongfu has quit IRC (Remote host closed the connection) [14:00] *** zhongfu has joined #archiveteam-bs [14:07] *** zhongfu_ has joined #archiveteam-bs [14:07] *** zhongfu has quit IRC (Read error: Connection reset by peer) [14:11] *** Zerote has quit IRC (Read error: Operation timed out) [14:24] *** Zerote has joined #archiveteam-bs [14:26] *** MrRadar2 has joined #archiveteam-bs [14:26] *** svchfoo3 sets mode: +o MrRadar2 [14:26] *** svchfoo1 sets mode: +o MrRadar2 [14:29] *** Damme has quit IRC (Read error: Connection reset by peer) [14:30] *** zhongfu_ has quit IRC (Quit: cya losers) [14:31] *** zhongfu has joined #archiveteam-bs [16:17] *** vitzli has joined #archiveteam-bs [16:18] *** vitzli has quit IRC (Client Quit) [17:12] *** wp494 has quit IRC (Ping timeout: 615 seconds) [17:13] *** wp494 has joined #archiveteam-bs [17:57] imagine this cunt on your doorbell: https://www.youtube.com/watch?v=lmEhCDRvQKA [18:53] there is no u in archiveteam [18:54] archiveteaum [18:54] checkmate americans [18:55] urchinteam [18:56] https://en.wikipedia.org/wiki/Sea_urchin [18:56] No thank you [18:57] they're basically the cutest [19:08] muricateam [19:58] *** VerifiedJ has quit IRC (Quit: Leaving) [20:43] *** Zerote has quit IRC (Ping timeout: 600 seconds) [20:45] i'm uploading a magazine called Cmeha or Smeha [20:46] i'm calling it Cmeha cause thats what looks like on pages [20:46] pdfs are coming from here http://smena-online.ru/archive/1924 [20:46] urls are going to look like this : cmeha-1924-issue-01 [21:12] *** Zerote has joined #archiveteam-bs [21:17] *** schbirid2 has quit IRC (Remote host closed the connection) [21:50] *** Despatche has joined #archiveteam-bs [22:05] This might be old news, but I just saw that the official IA help site mentions us: https://help.archive.org/hc/en-us/articles/360001513491-Save-Pages-in-the-Wayback-Machine :-) [22:25] Soo, Ghostbin. It seems they use [0-9a-z] as the charset and length 5 for public, 8 for password-protected pastes. The latter is way too long, but the former would be 60 million URLs, which is a lot but feasible if they don't have a problem with a few hundred requests per second. [22:26] http://xor.meo.ws/03e6b551/7bd2/4fe0/bee4/f9a7c1f84ac7.png [22:26] want me to do an export? [22:27] It seems there's still some discussion about whether we want to archive it at all. [22:27] Also, what does that screenshot tell me exactly? (How many URLs?) [22:30] JAA: http://xor.meo.ws/DLURRMOXR11ypOW7JQdGKV412JnqsXqV/screencapture-majestic-reports-site-explorer-2019-05-18-00_29_19.png [22:31] 1.21M backlinks to ghostbin.com [22:31] for discovery reasons [22:32] That's the number of (indexed) pages where a Ghostbin URL appears, right? [22:32] aye [22:32] And 46.5k URLs? [22:32] Ghostbin URLs* [22:33] afaik those are amount of urls that were crawled ON ghostbin.com itself [22:33] Ah [22:34] i dont know enough about how the majestic seo engine itself works to be 100% sure about that [22:34] So which of those numbers is how many ghostbin.com URLs there are in the index (crawled or not)? [22:34] but 1.21M historic backlinks is everything that we need to know about [22:35] although backlinks doesnt necessarily mean that this is as many urls we can discover through that since there may be duplicates, different domains pointing to the same url, etc. [22:35] Yeah, right. [22:36] indexed urls might be that number though [22:36] *** JH88 has quit IRC (Read error: Operation timed out) [22:37] 72,254 indexed urls in the historical database [23:16] *** BlueMax has joined #archiveteam-bs