#archiveteam-bs 2017-09-15,Fri

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
godanehttps://www.flickr.com/photos/52611635@N06/
the guys flickr pictures
with pictures of tapes
SketchCow: btw please send me tapes to help digitizing of them
[00:04]
.... (idle for 18mn)
***TheLovina has joined #archiveteam-bs
drumstick has quit IRC (Ping timeout: 255 seconds)
[00:24]
RichardG has quit IRC (Ping timeout: 255 seconds) [00:35]
BlueMaxim has joined #archiveteam-bs
JensRex has quit IRC (Remote host closed the connection)
JensRex has joined #archiveteam-bs
[00:40]
............ (idle for 55mn)
Asparagir has quit IRC (Asparagir) [01:38]
drumstick has joined #archiveteam-bs [01:43]
.... (idle for 17mn)
Honno has joined #archiveteam-bs
refeed has joined #archiveteam-bs
_refeed_ has joined #archiveteam-bs
refeed has quit IRC (Client Quit)
_refeed_ is now known as refeed
[02:00]
refeed has quit IRC (Ping timeout: 260 seconds) [02:14]
Stilett0 has joined #archiveteam-bs [02:23]
.... (idle for 15mn)
Honno has quit IRC (Read error: Operation timed out) [02:38]
Asparagir has joined #archiveteam-bs
svchfoo3 sets mode: +o Asparagir
svchfoo1 sets mode: +o Asparagir
[02:46]
refeed has joined #archiveteam-bs [02:58]
..... (idle for 21mn)
_refeed_ has joined #archiveteam-bs
refeed has quit IRC (Read error: Connection reset by peer)
[03:19]
__refeed_ has joined #archiveteam-bs
_refeed_ has quit IRC (Read error: Connection reset by peer)
[03:30]
.... (idle for 17mn)
Stilett0 is now known as Stiletto [03:47]
__refeed_ has quit IRC (Read error: Connection reset by peer) [04:01]
.... (idle for 15mn)
__refeed_ has joined #archiveteam-bs [04:16]
pizzaiolo has quit IRC (Quit: pizzaiolo) [04:24]
balrog has quit IRC (Read error: Operation timed out)
REiN^ has quit IRC (Read error: Operation timed out)
Mayonaise has quit IRC (Read error: Operation timed out)
squires has quit IRC (Write error: Broken pipe)
ruunyan has quit IRC (Read error: Operation timed out)
C4K3 has quit IRC (Read error: Operation timed out)
Asparagir has quit IRC (Read error: Operation timed out)
spacegirl has quit IRC (Read error: Operation timed out)
Mayonaise has joined #archiveteam-bs
robogoat has quit IRC (Read error: Operation timed out)
Odd0002 has quit IRC (Read error: Operation timed out)
bwn has quit IRC (Read error: Operation timed out)
__refeed_ has quit IRC (Ping timeout: 260 seconds)
drumstick has quit IRC (Read error: Operation timed out)
rocode has quit IRC (Read error: Operation timed out)
Baljem has quit IRC (Read error: Operation timed out)
balrog has joined #archiveteam-bs
swebb sets mode: +o balrog
svchfoo3 sets mode: +o balrog
__refeed_ has joined #archiveteam-bs
robogoat has joined #archiveteam-bs
htw has quit IRC (Read error: Operation timed out)
spacegirl has joined #archiveteam-bs
Odd0002 has joined #archiveteam-bs
Dimtree has quit IRC (Read error: Operation timed out)
PotcFdk has quit IRC (Read error: Operation timed out)
godane has quit IRC (Read error: Operation timed out)
tfgbd_znc has quit IRC (Read error: Operation timed out)
drumstick has joined #archiveteam-bs
robink has quit IRC (Read error: Operation timed out)
robink has joined #archiveteam-bs
htw has joined #archiveteam-bs
__refeed_ has quit IRC (Ping timeout: 260 seconds)
bwn has joined #archiveteam-bs
godane has joined #archiveteam-bs
Sk1d has quit IRC (Ping timeout: 250 seconds)
REiN^ has joined #archiveteam-bs
rocode has joined #archiveteam-bs
ruunyan has joined #archiveteam-bs
Sk1d has joined #archiveteam-bs
Sk1d has quit IRC (Connection Closed)
Sk1d has joined #archiveteam-bs
C4K3 has joined #archiveteam-bs
squires has joined #archiveteam-bs
tfgbd_znc has joined #archiveteam-bs
[04:31]
PotcFdk has joined #archiveteam-bs [05:07]
Dimtree has joined #archiveteam-bs
Baljem has joined #archiveteam-bs
[05:15]
what_the_ has quit IRC (Ping timeout: 268 seconds) [05:22]
__refeed_ has joined #archiveteam-bs [05:29]
........ (idle for 39mn)
Aranje has quit IRC (Quit: Three sheets to the wind)
etudier has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…)
[06:08]
tfgbd_znc has quit IRC (Read error: Connection reset by peer)
tfgbd_znc has joined #archiveteam-bs
[06:18]
__refeed_ has quit IRC (Remote host closed the connection) [06:28]
.......... (idle for 49mn)
yuitimoth has quit IRC (Read error: error:1408F119:SSL routines:SSL3_GET_RECORD:decryption failed or bad record mac)
yuitimoth has joined #archiveteam-bs
yuitimoth has quit IRC (Read error: error:1408F119:SSL routines:SSL3_GET_RECORD:decryption failed or bad record mac)
yuitimoth has joined #archiveteam-bs
yuitimoth has quit IRC (Read error: error:1408F119:SSL routines:SSL3_GET_RECORD:decryption failed or bad record mac)
yuitimoth has joined #archiveteam-bs
yuitimoth has quit IRC (Read error: error:1408F119:SSL routines:SSL3_GET_RECORD:decryption failed or bad record mac)
yuitimoth has joined #archiveteam-bs
yuitimoth has quit IRC (Read error: error:1408F119:SSL routines:SSL3_GET_RECORD:decryption failed or bad record mac)
yuitimoth has joined #archiveteam-bs
[07:17]
.............. (idle for 1h6mn)
tuluu has quit IRC (Quit: No Ping reply in 180 seconds.)
tuluu has joined #archiveteam-bs
Honno has joined #archiveteam-bs
[08:24]
.......... (idle for 49mn)
drumstick has quit IRC (Read error: Operation timed out)
drumstick has joined #archiveteam-bs
[09:20]
........... (idle for 52mn)
hook54321Fyi, the maintainer of the Ublock repository (not ublock origin) has been deleting comments, issues, etc, asking about where the funds are going if there isn't any development going on. https://github.com/chrisaljoudi/uBlock [10:13]
JAAShall we grab https://github.com/chrisaljoudi/uBlock as well (without the code)?
(Moved from #archivebot)
Looks like the issues are still around, e.g. https://github.com/chrisaljoudi/uBlock/issues/1706
Ah yeah, he deleted comments. Hmm
[10:19]
hook54321one sec
I'm holding down the end key on twitter
[10:23]
JAAYeah, this isn't urgent, looks like most of the stuff happened two months ago anyway (including that ticket I linked). [10:26]
......... (idle for 41mn)
***pizzaiolo has joined #archiveteam-bs [11:07]
RichardG has joined #archiveteam-bs [11:12]
refeed has joined #archiveteam-bs
sep332 has quit IRC (Ping timeout: 260 seconds)
drumstick has quit IRC (Ping timeout: 255 seconds)
[11:25]
hook54321finally
It went from 4 hours ago, to 22 hours ago, to september 5th.
[11:31]
***_refeed_ has joined #archiveteam-bs [11:37]
yuitimoth has quit IRC (Read error: error:1408F119:SSL routines:SSL3_GET_RECORD:decryption failed or bad record mac)
yuitimoth has joined #archiveteam-bs
yuitimoth has quit IRC (Read error: error:1408F119:SSL routines:SSL3_GET_RECORD:decryption failed or bad record mac)
yuitimoth has joined #archiveteam-bs
BlueMaxim has quit IRC (Quit: Leaving)
yuitimoth has quit IRC (Read error: error:1408F119:SSL routines:SSL3_GET_RECORD:decryption failed or bad record mac)
yuitimoth has joined #archiveteam-bs
yuitimoth has quit IRC (Remote host closed the connection)
yuitimoth has joined #archiveteam-bs
[11:48]
hook54321!w f3p6sgk7e3f2hxwn0hy1w3xcq [11:50]
***_refeed_ has quit IRC (Leaving) [11:55]
tobbez has joined #archiveteam-bs
godane has quit IRC (Ping timeout: 260 seconds)
[12:02]
................... (idle for 1h32mn)
sep332 has joined #archiveteam-bs
pizzaiolo has quit IRC (Ping timeout: 245 seconds)
pizzaiolo has joined #archiveteam-bs
[13:36]
JAAhook54321: You're not the only one getting temp-banned from bit.ly. I currently only get 403 replies from them on at least one machine. [13:55]
***JAA___ has joined #archiveteam-bs
JAA sets mode: +o JAA___
JAA has quit IRC (leaving)
JAA has joined #archiveteam-bs
swebb sets mode: +o JAA
[14:03]
JAA___ has quit IRC (Quit: Page closed) [14:13]
pikhq has quit IRC (Read error: Operation timed out) [14:23]
odemgSketchCow, We've got people saying things like https://www.reddit.com/r/DataHoarder/comments/704h1g/saw_this_on_another_hoarding_site_first_there/dn0r7p6/ ..what's the status of ia/at getting these tapes? It'd be a shame to let some donut pick them up thinking he could digitise 24k tapes by himself :/ there's no coordination going on to I bet this guy is getting 100s of emails from people wasting his time and
ours.
[14:31]
JAAodemg: http://archive.fart.website/bin/irclogger_log/archiveteam-bs?date=2017-09-14,Thu&sel=301#l297 [14:37]
DFJustinhttps://twitter.com/textfiles/status/908432524128456704 [14:37]
odemgThank fuck. [14:38]
***Mateon1 has quit IRC (Read error: Operation timed out)
Mateon1 has joined #archiveteam-bs
[14:41]
........ (idle for 35mn)
VADemon has joined #archiveteam-bs [15:16]
..... (idle for 22mn)
Aoedejrwr: why are they rioting [15:38]
jrwrSo
The main headline is "Police Officer Murder Trail: NOT GUILTY"
in 2011 a black guy named Anthony Smith Killed after a care chase
car*
[15:38]
AoedeI see [15:39]
jrwrso its a black lives matter protest [15:40]
Aoedethanks for the info :-) [15:40]
........ (idle for 39mn)
astridfuckin
why can't cops just like ... not kill people
it doesn't seem that complicated. i don't kill people. never even once!
[16:19]
Froggingi killed a centipede yesterday [16:25]
zinoI choose to belive that the spider I washed down in the shower this morning is living a happy life with his alligator buddies. [16:26]
***_refeed_ has joined #archiveteam-bs
refeed has quit IRC (Ping timeout: 600 seconds)
[16:37]
.......... (idle for 45mn)
atrocity has quit IRC (Ping timeout: 260 seconds) [17:26]
..... (idle for 23mn)
hook54321I'm just gonna place this here, I don't have time to do anything with it right now. It's a list of x.vu URLs.
https://gist.github.com/anonymous/e6a10e4adad2db2c453366ecd07c7359
[17:49]
***sun_shine has joined #archiveteam-bs [17:57]
..... (idle for 22mn)
RichardG has quit IRC (Read error: Connection reset by peer)
Asparagir has joined #archiveteam-bs
svchfoo3 sets mode: +o Asparagir
svchfoo1 sets mode: +o Asparagir
dd0a13f37 has joined #archiveteam-bs
RichardG has joined #archiveteam-bs
dd0a13f37 has quit IRC (Ping timeout: 268 seconds)
BartoCH has joined #archiveteam-bs
dd0a13f37 has joined #archiveteam-bs
[18:19]
dd0a13f37What is archiveteam's position on archiving stuff obtained by questionable means? For example, would you accept scrapes done with hijacked accounts? AT presumably violates ToS frequently, but do you have any official "guidelines" on what to do/not do? [18:33]
MrRadarFor services that require accounts we generally sign up for one specifically for archiving if that's possible, or ask for people to donate accoutns if not [18:34]
dd0a13f37But for example paywalled content for #newsgrabber [18:35]
MrRadarFor most sites just browsing with cookies disabled is enough to bypass the paywall, so that's not an issue [18:36]
dd0a13f37Well, for some, but some others (svd.se for example) require you to be logged in to an account that in turn needs to be registered with valid and working payment info [18:36]
MrRadarThose kind of cases I'm not sure about ¯\_(ツ)_/¯ [18:37]
dd0a13f37Also, there are complete .pdf archives of some news papers which need premium/subscriber accounts to access, but you can download them as long as you have the URL.
So you could log in with tor, get all the URLs, then use much more crude methods to fetch the actual files
If you want to archive them, it seems like the least bad method. You could have a volunteer register a trial account with valid payment info, but I don't think it's a brilliant idea to give our your name and address to do what legally speaking probably is copyright infringement of some kind to the people whose copyright you're allegedly infringing
Also, the getting of accounts can be done programatically. You would only need a few to stay clear of ratelimits, and most of the news sites have insecure login forms (doesn't say "invalid login", says "invalid email"/"invalid password")
WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
[18:37]
yipdwdd0a13f37: usually, getting on people's bad sides depends a lot on who that person is and how they're perceived relative to the rest of society
as a rule, pushing for shady means is frowned upon here
it's not hard-and-fast and I'm afraid that if you're looking for something you could codify in a program you won't find it here
[18:51]
dd0a13f37Okay, thanks.
Could you tell me the secret word for wiki account creation?
[18:52]
yipdwI can't remember what it is offhand [18:53]
dd0a13f37Well yes, it definitely goes under "shady" by any definition to use hijacked accounts, so that's pretty clear.
Okay, thanks. Could you add links to http://libgen.io/libgen/repository_torrent/ http://libgen.io/dbdumps/libgen/ to the page "Library Genesis"?
It's 30tb (for the books, much more in papers), and they're not in very good health (the server went down just now for instance)
[18:53]
***pikhq has joined #archiveteam-bs [19:02]
.... (idle for 16mn)
lag has joined #archiveteam-bs [19:18]
_refeed_ has quit IRC (Read error: Operation timed out) [19:29]
godane has joined #archiveteam-bs [19:42]
dd0a13f37What are your thoughts on archiving bittorrent DHT? There are some projects that are scraping it already, like btdig, btdb, torrentproject(dead), itorrents(not scraping but has a huge amount of torrents) [19:52]
astridi've thought about it
i have ENTIRELY too many projects, but it's appealing
[19:53]
dd0a13f37It should be as simple as asking them for a copy, the problem is what to do with the ones that discard torrent files after scraping them
Hey astrid, hate to spam, but do you have the wiki secret word? It's not y********s anymore
[19:54]
astridyeah i do, sec [19:57]
...... (idle for 25mn)
***dd0a13f37 has quit IRC (Ping timeout: 268 seconds) [20:22]
JAA"i have ENTIRELY too many projects, but it's appealing" -- That sounds way too familiar. [20:22]
zinoHey, you have all figured out my middle name! [20:23]
JAASpeaking of that: does anyone know of any active Reddit archival efforts? I had an idea yesterday...
Unfortunately, searching the logs for "reddit" is not particularly helpful.
(I'm aware of the comment dump up to 2015, hence why I wrote "active".)
[20:26]
***dd0a13f37 has joined #archiveteam-bs [20:32]
dd0a13f37Would it be too out of scope to run an archiveteam project to scrape the DHT? It will definitely be useful for the future, you can find tons of obscure stuff in other p2p networks if you have filenames etcetera, and it's quite "cheap" (4mb gets you one image or hundreds of torrents)
As in, not indexing services but getting it straight form the soruce
[20:33]
Asparagir!ig an2l7kygr2q9ilkuydo3qimq1 ^https?://sputniknews\.com/services/likes/
d'oh
[20:35]
dd0a13f37JAA: Isn't that still updated? [20:43]
JAAdd0a13f37: I haven't seen anything recently updated on IA, at least.
Oh right, here it is: http://files.pushshift.io/reddit/comments/
[20:44]
dd0a13f37oh
jackpot https://files.pushshift.io/
[20:47]
JAAAnd on IA: https://archive.org/details/reddit-data-comments [20:47]
dd0a13f37Why hasn't anyone come up with a decent solution for Tor on IRC? [20:47]
JAASweet. I won't have to do anything then. :-) [20:47]
godaneso looks like my squashfs file is look as a wave file for some reason [20:48]
dd0a13f37Allowing channel operators to ignore the ban, ask you to solve some captchas and wait a few days, allowing people to login to previously registered accounts, etc
run ffplay on it and see what happens
[20:48]
JAA#1 and #3 exist, but only on decent IRC networks (i.e. not EFNet). [20:49]
dd0a13f37I know that's how freenode does it [20:50]
godaneEVEN WHEN I PUT IT AS A .squashfs
still think its a wave file
[20:50]
JAAYes, Freenode belongs to the decent IRC networks. [20:51]
godanemy problem is with this item: https://archive.org/details/slackwarearm-14.2-20170906-kiwix [20:51]
dd0a13f37#2 works fine on freenet which is 100% anonymous and extremely slowly moderated AND has a huge spam problem, it's also used by swedish forum flashback. EFnet is nice and decentralized though
godane: what's the issue? you can still mount ot
[20:51]
JAATrue, but that also causes a decent amount of issues (e.g. netsplits). [20:52]
godaneits deriving like its a wave file
i'm trying to stop deriving
[20:52]
dd0a13f37deriving? [20:52]
godaneit trys to make a wave file into mp3, flac file [20:53]
dd0a13f37What? [20:53]
godanebut its not wave [20:53]
dd0a13f37just mount it manually if you're onl inux
a.org also recognizes it as such, see https://ia601500.us.archive.org/21/items/slackwarearm-14.2-20170906-kiwix/slackwarearm-14.2-20170906-kiwix_files.xml
sudo mount -o loop whatever.squashfs /mntpath/
[20:53]
godanei know that
my problem is the IA thinking its a wave
also i'm trying to stop it from deriving
[20:54]
dd0a13f37You can download the file though [20:55]
godaneits my fiel [20:55]
dd0a13f37if all else fails use the torrent
oh ok i see
[20:55]
astridthe .sb suffix is throwing it off apparently
according to the derive log https://catalogd.archive.org/log/733622918
[20:56]
godaneastrid: but i put squashfs as file name [20:56]
astridyeah that's weird
can you queue a derive with delete of all former derive results?
[20:56]
dd0a13f37It's not doing anything with the magics? [20:57]
astridbc it looks like you changed something in the last few minutes
but there's no derive scheduled
[20:57]
dd0a13f37xxd file | head -n 1, does it start with RIFF? [20:58]
godanei change the end from img to squashfs since .img was still saying its a wave file [20:58]
astridah, so you changed the filename? yeah, re-queue a derive and tick the "delete all prior versions" box [20:59]
godanei delete the derive manually [20:59]
astridthat also works i guess [21:00]
dd0a13f37If I want to contact someone for archival efforts, should I ask someone here to do it so it's done "officially" or can I just email them and ask for a DB copy? [21:01]
astrid#2 [21:01]
***etudier has joined #archiveteam-bs [21:04]
dd0a13f37https://pastebin.com/GztDCtV3 Is there anything else I should add? [21:11]
***etudier has quit IRC (Ping timeout: 370 seconds) [21:11]
..... (idle for 22mn)
AsparagirMight want to explain a little bit about who you and why you want the info, so they don't think you work for the RIAA or MPAA or something. [21:33]
dd0a13f37Too late, already sent it. But I do mention Torrentproject shutting down
itorrents is run by a guy in pakistan who writes out his full name and address on whois and also openly runs limetorrents, so I dont think he is worried about MPAA
btdigg used to provide an API, so they should understand.
[21:37]
dashcloudMrRadar: rather late, but the Hauppauge products are always reliable- I've used a PVR950q, and before that I used the PVR250 (which was a hardware MPEG2 encoder) [21:41]
dd0a13f37wow, the dht is big and it has more search engines than I thought
2-3tb of incompressible data, 13 chinese search engines, torbt, digbt, the 4 ones I already mentioned
[21:43]
..... (idle for 23mn)
***drumstick has joined #archiveteam-bs [22:09]
BartoCH has quit IRC (Quit: WeeChat 1.9) [22:20]
dd0a13f37 has quit IRC () [22:32]
............ (idle for 57mn)
robink has quit IRC (Ping timeout: 260 seconds)
robink has joined #archiveteam-bs
[23:29]
etudier has joined #archiveteam-bs [23:46]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)