#archiveteam-bs 2019-07-21,Sun

↑back Search

Time Nickname Message
00:26 πŸ”— ephemer0l has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.)
00:31 πŸ”— ephemer0l has joined #archiveteam-bs
01:52 πŸ”— killsushi has joined #archiveteam-bs
02:32 πŸ”— Somebody2 continuing from #archiveteam, Raccoon
02:32 πŸ”— Raccoon ok
02:33 πŸ”— Somebody2 there's no index of stuff that was grabbed by archiveteam and not put up on archive.org -- that's correct.
02:33 πŸ”— Raccoon that's a very specific request I didn't make though :)
02:33 πŸ”— Somebody2 The archiveteam wiki might provide some vague hints
02:33 πŸ”— Raccoon I see what you did there
02:34 πŸ”— Somebody2 and as for an index of stuff grabbed by archiveteam and put up on archive.org -- for that, you can just use the archive.org search
02:34 πŸ”— Raccoon so what about the stuff that archiveteam grabbed, while forgetting that archive.org even exists
02:34 πŸ”— Somebody2 that would only be documented on the archiveteam wiki
02:35 πŸ”— Somebody2 generally, most efforts by archiveteam are documented there
02:35 πŸ”— Raccoon ok, so there is a complete file index somewhere on the wiki
02:35 πŸ”— Somebody2 but not particularly completely or consistently
02:35 πŸ”— Somebody2 a file index? heck no. It's an index of *projects* -- i.e. "the time when we grabbed this recipie website"
02:36 πŸ”— Somebody2 or "the time we got yelled at by Apple for trying to grab something they were trying to remove"
02:36 πŸ”— Somebody2 (these are hypothetical examples)
02:36 πŸ”— Raccoon most projects are comprised of files with file names, right?
02:37 πŸ”— Somebody2 most projects are lists of URLs that are grabbed and stored into WARC files.
02:37 πŸ”— Somebody2 and then posted to archive.org and included in the Wayback Machine
02:37 πŸ”— Somebody2 that's the usual flow
02:37 πŸ”— Raccoon hmm
02:38 πŸ”— Raccoon that's the kicker. you can't really search for a *\filename.here query on archive.or
02:38 πŸ”— Raccoon .org
02:38 πŸ”— Somebody2 yes you can
02:38 πŸ”— Raccoon one needs to know the domain name for which they want to find said file
02:38 πŸ”— Somebody2 ah, not in the Wayback Machine, I see
02:38 πŸ”— Somebody2 yeah, the Wayback Machine search is still somewhat limited
02:39 πŸ”— Somebody2 although I think it allows more than full URL searches (but I think it is still limited to domains)
02:39 πŸ”— Raccoon and you say that archiveteam doesn't keep a master index of urls that can be grepped for \\filename\.here
02:45 πŸ”— Raccoon Hopefully if the day comes where we need to find all ten parts of the disarm code to the doom's day machine (cleverly hosted on 10 different free hosting websites between 1998 and 2005, under the filename disarmcode.html), the search engine will be prepared for this :P
03:03 πŸ”— Somebody2 hopefully!
03:03 πŸ”— Somebody2 I mean, if someone wanted to throw enough money at one of the cloud services, making such a filename index at least of the stuff ...
03:04 πŸ”— Somebody2 ... grabbed by archiveteam would absolutely be possible.
03:04 πŸ”— Somebody2 And I don't see any reason it wouldn't be welcomed.
03:04 πŸ”— * Flashfire throws monopoly money
03:05 πŸ”— Maylay has quit IRC (Pipe Terminated)
03:05 πŸ”— * Somebody2 catches the monopoly money, stares at it (it says: "Valid only on AWS; expires in 1995"), and throws it back
03:07 πŸ”— Maylay has joined #archiveteam-bs
03:07 πŸ”— Maylay has quit IRC (Remote host closed the connection!)
03:08 πŸ”— Maylay has joined #archiveteam-bs
03:10 πŸ”— kiska I assume we know about tinypic shutting down?
03:11 πŸ”— kiska https://server8.kiska.pw/uploads/397c396a6c83b1bc/unknown.png
03:12 πŸ”— Raccoon :(
03:13 πŸ”— wyatt8740 has quit IRC (Remote host closed the connection)
03:16 πŸ”— Raccoon Somebody2: that's one of the things that's carried me through the years in completing collections. Getting lucky with a Google intitle:"index of" search of known filenames already in the set, finding somebody else hosting them, and hitting the jackpot
03:16 πŸ”— Raccoon since other people have picked up on that, this has really gone away for the most part. (i discovered the trick before it was hipster)
03:17 πŸ”— Raccoon would also clue me in on the merits of contributing some of my junk
03:17 πŸ”— Somebody2 Raccoon: hm?
03:18 πŸ”— Raccoon hmm?
03:18 πŸ”— arkiver ah shit
03:18 πŸ”— arkiver kiska: letΒ΄s think of a channel
03:19 πŸ”— dxrt oh shit
03:20 πŸ”— Raccoon how many dick pics do you suppose they've hosted over the years?
03:20 πŸ”— kiska I think #ohshit is applicable
03:20 πŸ”— dxrt #tinydick
03:20 πŸ”— Raccoon jinxed dxrt
03:20 πŸ”— arkiver nice
03:20 πŸ”— dxrt haha
03:21 πŸ”— arkiver kiska: agree with #tinydick?
03:21 πŸ”— kiska Yep
03:21 πŸ”— arkiver awesome
03:22 πŸ”— Raccoon can't wait for mega.nz to shut down
03:23 πŸ”— * Raccoon squats on #megadick for the puns
03:23 πŸ”— kiska Please no mega.nz there is so much js on there
03:24 πŸ”— Raccoon don't they have an api
03:24 πŸ”— Raccoon mobile apps
03:33 πŸ”— ivan_ https://github.com/meganz/MEGAcmd
03:58 πŸ”— qw3rty119 has joined #archiveteam-bs
04:04 πŸ”— qw3rty118 has quit IRC (Read error: Operation timed out)
04:15 πŸ”— Pixi` has quit IRC (Read error: Connection reset by peer)
04:15 πŸ”— Pixi has joined #archiveteam-bs
04:17 πŸ”— d5f4a3622 has quit IRC (Read error: Connection reset by peer)
04:19 πŸ”— d5f4a3622 has joined #archiveteam-bs
04:46 πŸ”— Pokemonpr has joined #archiveteam-bs
04:47 πŸ”— Pokemonpr Hey, question. Is there a way to check what the bot has archived?
04:55 πŸ”— ivan_ Which bot
04:55 πŸ”— Pokemonpr ...uhm, the main one that auto-archives smaller sites? Not sure if it has a proper name
04:56 πŸ”— ivan_ See #archivebot topic
04:56 πŸ”— Pokemonpr There's a site I suggested for it to archive a little while ago; but I'm not sure if it got done. The site went from "Sort of dead but not in real danger" to "We're shutting down August 1st" and I wanted to check
04:56 πŸ”— Pokemonpr Thank you
04:56 πŸ”— Pokemonpr Haven't been here in a little while, sorry if this was a dumb quesiton
04:57 πŸ”— ivan_ http://archive.fart.website/archivebot/viewer/
04:59 πŸ”— godane1 has joined #archiveteam-bs
05:00 πŸ”— godane has quit IRC (Read error: Operation timed out)
05:06 πŸ”— Pokemonpr Can't see it there.. Could I request something then?
05:58 πŸ”— dxrt Pokemonpr: what site?
06:03 πŸ”— m007a83 has quit IRC (Read error: Operation timed out)
06:07 πŸ”— Pokemonpr dxrt it's amesfanclub.com ; it recently announced out of the blue that it was going down August 1st due to reasons out of their control.
06:08 πŸ”— Dimtree has joined #archiveteam-bs
06:11 πŸ”— dxrt Pokemonpr: I've added it.
06:34 πŸ”— Pokemonpr thank you
07:58 πŸ”— m007a83 has joined #archiveteam-bs
09:45 πŸ”— VerifiedJ has joined #archiveteam-bs
10:23 πŸ”— JAA Somebody2: How does one search by filename on IA? I'm not aware of any method to do so. It only searches item metadata (identifier, title, description, etc.), not the names of files inside items.
11:00 πŸ”— BlueMax has quit IRC (Quit: Leaving)
12:07 πŸ”— BIER has joined #archiveteam-bs
12:07 πŸ”— BIER has quit IRC (Client Quit)
12:18 πŸ”— killsushi has quit IRC (Quit: Leaving)
13:04 πŸ”— HashbangI has quit IRC (Remote host closed the connection)
13:12 πŸ”— HashbangI has joined #archiveteam-bs
14:34 πŸ”— Somebody2 um, let me look.
14:35 πŸ”— Somebody2 if nothing else, you can download the (now rather out of date) IA census and use that.
14:35 πŸ”— Somebody2 that may be the only way, indeed.
15:10 πŸ”— Hani has quit IRC (Quit: Hani)
15:17 πŸ”— Verified_ has quit IRC (Ping timeout: 252 seconds)
15:47 πŸ”— fallenoak All you really need to search by files in the projects that produced megawarcs is the cdx files (which are fortunately fairly small)
15:47 πŸ”— fallenoak At least, that's what I did when I wanted to comb through the files in the GameFront grab
15:49 πŸ”— wyatt8740 has joined #archiveteam-bs
16:05 πŸ”— Igloo betamax: CD’s sorted
16:06 πŸ”— schbirid has joined #archiveteam-bs
16:16 πŸ”— betamax Yay!
16:29 πŸ”— Somebody2 fallenoak: yes, but you need to know which project you care about first :-)
16:38 πŸ”— Hani has joined #archiveteam-bs
17:00 πŸ”— Igloo if http_stat.statcode == 500 then
17:00 πŸ”— Igloo -- try again
17:00 πŸ”— Igloo woops
17:16 πŸ”— DogsRNice has joined #archiveteam-bs
18:15 πŸ”— Joseph_ has joined #archiveteam-bs
18:15 πŸ”— VerifiedJ has quit IRC (Read error: Connection reset by peer)
18:26 πŸ”— antomati_ is now known as antomatic
18:41 πŸ”— Dallas has joined #archiveteam-bs
18:50 πŸ”— hi has joined #archiveteam-bs
19:45 πŸ”— VerifiedJ has joined #archiveteam-bs
19:45 πŸ”— Joseph_ has quit IRC (Read error: Connection reset by peer)
19:46 πŸ”— VerifiedJ has quit IRC (Read error: Connection reset by peer)
19:47 πŸ”— VerifiedJ has joined #archiveteam-bs
19:50 πŸ”— Ravenloft has joined #archiveteam-bs
19:57 πŸ”— DogsRNice has quit IRC (Ping timeout: 252 seconds)
19:58 πŸ”— Dj-Wawa has joined #archiveteam-bs
19:59 πŸ”— DogsRNice has joined #archiveteam-bs
20:07 πŸ”— hi has quit IRC (Quit: Page closed)
20:19 πŸ”— DogsRNice has quit IRC (Ping timeout: 252 seconds)
20:28 πŸ”— Smiley has quit IRC (Ping timeout: 265 seconds)
20:29 πŸ”— Smiley has joined #archiveteam-bs
20:38 πŸ”— schbirid has quit IRC (Remote host closed the connection)
20:39 πŸ”— DogsRNice has joined #archiveteam-bs
20:59 πŸ”— Ravenloft has quit IRC (Remote host closed the connection)
21:42 πŸ”— VerifiedJ has quit IRC (Read error: Connection reset by peer)
21:42 πŸ”— VerifiedJ has joined #archiveteam-bs
22:28 πŸ”— Dj-Wawa has quit IRC (Quit: Connection closed for inactivity)
22:29 πŸ”— Dj-Wawa has joined #archiveteam-bs
22:30 πŸ”— fredgido_ has quit IRC (Read error: Operation timed out)
22:36 πŸ”— dashcloud has quit IRC (Remote host closed the connection)
22:37 πŸ”— dashcloud has joined #archiveteam-bs
23:22 πŸ”— BlueMax has joined #archiveteam-bs
23:26 πŸ”— fredgido has joined #archiveteam-bs
23:26 πŸ”— SmileyG has joined #archiveteam-bs
23:27 πŸ”— Smiley has quit IRC (Read error: Operation timed out)
23:32 πŸ”— benjinsmi has joined #archiveteam-bs
23:33 πŸ”— benjins has quit IRC (Ping timeout: 252 seconds)
23:42 πŸ”— benjins has joined #archiveteam-bs
23:43 πŸ”— benjinsmi has quit IRC (Ping timeout: 604 seconds)
23:47 πŸ”— VerifiedJ has quit IRC (Read error: Operation timed out)

irclogger-viewer