Time |
Nickname |
Message |
00:01
π
|
|
closure has joined #archiveteam-bs |
00:35
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
00:54
π
|
|
closure has joined #archiveteam-bs |
01:00
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
01:01
π
|
|
closure has joined #archiveteam-bs |
01:08
π
|
|
Stilett0 has quit IRC (Read error: Connection reset by peer) |
01:08
π
|
|
Stilett0 has joined #archiveteam-bs |
01:25
π
|
godane |
SketchCow: 623 pdfs ED files are saved doing the audit for ERIC archive |
01:29
π
|
godane |
hear are all the pdfs added so far by doing this audit : https://archive.org/details/@chris85?and[]=addeddate%3A2018-09+eric&and[]=subject%3A%22ERIC+Archive%22 |
01:35
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
01:35
π
|
|
closure has joined #archiveteam-bs |
02:00
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
02:01
π
|
|
closure has joined #archiveteam-bs |
02:09
π
|
|
yoyo11 has joined #archiveteam-bs |
02:10
π
|
|
yoyo11 has quit IRC (Quit: Page closed) |
02:11
π
|
moufu |
is there a reason warrior projects use wget-lua 1.14.* instead of 1.17.1? I've noticed 1.14.* doesn't extract urls from html5 media tags |
02:21
π
|
ivan |
probably because no one updated it |
02:22
π
|
ivan |
wpull might be a better option though |
02:34
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
02:35
π
|
|
closure has joined #archiveteam-bs |
02:59
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
03:05
π
|
|
closure has joined #archiveteam-bs |
03:30
π
|
Flashfire |
Can someone scrape https://mobile.twitter.com/BJTMarts |
03:35
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
03:35
π
|
ivan |
Flashfire: https://gist.githubusercontent.com/ivan/2e6a0431bb41745060ea8b5ddf218852/raw/e1a9cf3e0f9a4132ed467d92d8c12f9d16955684/gistfile1.txt |
03:36
π
|
ivan |
Flashfire: you can run snscrape on your computer, no? |
03:39
π
|
|
archodg__ has joined #archiveteam-bs |
03:39
π
|
Flashfire |
Yeah but donβt have access to it right now |
03:41
π
|
|
archodg_ has quit IRC (Ping timeout: 252 seconds) |
03:41
π
|
|
odemg has quit IRC (Ping timeout: 260 seconds) |
03:54
π
|
|
odemg has joined #archiveteam-bs |
04:27
π
|
|
closure has joined #archiveteam-bs |
04:32
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
04:47
π
|
|
closure has joined #archiveteam-bs |
04:59
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
05:00
π
|
|
closure_ has joined #archiveteam-bs |
05:37
π
|
|
closure has joined #archiveteam-bs |
05:37
π
|
|
closure_ has quit IRC (Read error: Connection reset by peer) |
06:05
π
|
|
m007a83 has quit IRC (Quit: Fuck you Comcast) |
06:06
π
|
|
closure has quit IRC (Read error: Operation timed out) |
06:08
π
|
|
Mateon1 has quit IRC (Ping timeout: 268 seconds) |
06:09
π
|
|
Mateon1 has joined #archiveteam-bs |
06:10
π
|
|
Petri152 has quit IRC (Read error: Operation timed out) |
06:10
π
|
|
JAA has quit IRC (Read error: Operation timed out) |
06:10
π
|
|
zyphlar has quit IRC (Read error: Operation timed out) |
06:11
π
|
|
jspiros has quit IRC (Read error: Operation timed out) |
06:13
π
|
|
bithippo has quit IRC (Ping timeout: 246 seconds) |
06:15
π
|
|
c4rc4s has quit IRC (Read error: Operation timed out) |
06:17
π
|
|
m007a83 has joined #archiveteam-bs |
06:23
π
|
|
tuluu has quit IRC (irc.Prison.NET west.us.hub) |
06:23
π
|
|
Coderjo has quit IRC (irc.Prison.NET west.us.hub) |
06:23
π
|
|
Dimtree has quit IRC (irc.Prison.NET west.us.hub) |
06:23
π
|
|
jut_ has quit IRC (irc.Prison.NET west.us.hub) |
06:23
π
|
|
moufu has quit IRC (irc.Prison.NET west.us.hub) |
06:23
π
|
|
TC01 has quit IRC (irc.Prison.NET west.us.hub) |
06:23
π
|
|
nyaomi has quit IRC (irc.Prison.NET west.us.hub) |
06:23
π
|
|
Sanqui has quit IRC (irc.Prison.NET west.us.hub) |
06:23
π
|
|
joepie91_ has quit IRC (irc.Prison.NET west.us.hub) |
06:24
π
|
|
logchfoo1 starts logging #archiveteam-bs at Sat Sep 29 06:24:52 2018 |
06:24
π
|
|
logchfoo1 has joined #archiveteam-bs |
06:25
π
|
|
zino_ has joined #archiveteam-bs |
06:25
π
|
|
phirephl- has joined #archiveteam-bs |
06:25
π
|
|
Polylith_ has joined #archiveteam-bs |
06:33
π
|
|
Atom-- has joined #archiveteam-bs |
06:33
π
|
|
achip has joined #archiveteam-bs |
06:36
π
|
|
Atom__ has quit IRC (Ping timeout: 252 seconds) |
06:54
π
|
|
achip has quit IRC (west.us.hub irc.Prison.NET) |
07:11
π
|
|
c4rc4s has joined #archiveteam-bs |
07:11
π
|
|
zyphlar has joined #archiveteam-bs |
07:11
π
|
|
Petri152 has joined #archiveteam-bs |
07:12
π
|
|
JAA has joined #archiveteam-bs |
07:12
π
|
|
swebb sets mode: +o JAA |
07:12
π
|
|
bakJAA sets mode: +o JAA |
07:15
π
|
|
jspiros has joined #archiveteam-bs |
07:22
π
|
|
achip has joined #archiveteam-bs |
08:33
π
|
|
C4K3_ has joined #archiveteam-bs |
08:35
π
|
|
C4K3 has quit IRC (Read error: Operation timed out) |
09:00
π
|
HCross |
JAA: good news. I got Brozzler working |
09:18
π
|
godane |
SketchCow: i'm up to 2018-06-27 with kpfa collection |
09:19
π
|
godane |
trying to get that up to date |
09:20
π
|
HCross |
or not.. xrdp just decided it didnt want to do anything |
09:39
π
|
|
Guestiiii has joined #archiveteam-bs |
09:40
π
|
Guestiiii |
hello |
09:40
π
|
Flashfire |
hi |
09:41
π
|
Guestiiii |
I'm trying to get "backups" of pages I bookmark. Is wpull the/a right method for that? |
09:41
π
|
Guestiiii |
I've seen it has little activity recently, and I'm having trouble actually making it work, so I'm not sure I'm doing it right. |
09:42
π
|
Guestiiii |
(And sorry if this is the wrong place to ask. I've come here since wpull's docs link to this place) |
09:42
π
|
Flashfire |
try grabsite made by our own ivan |
09:42
π
|
Flashfire |
IVAN WE HAVE SOMEONE TO SEE YOU |
09:43
π
|
Guestiiii |
I've seen that too, it's essentially a nice wrapper around wpull? |
09:43
π
|
ivan |
yes |
09:44
π
|
Guestiiii |
I've also seen something acting as a proxy, but losing the original ssl certs is not so good for me |
09:44
π
|
ivan |
you can for example dump your bookmarks to a list of URLs and use grab-site --1 -i FILE |
09:45
π
|
Guestiiii |
oh, that's neat |
09:45
π
|
ivan |
here is a bookmarklet for saving pages to wayback that you may or may not be able to access in the future javascript:(function(){window.open('https://web.archive.org/save/'+(''+window.location));})(); |
09:46
π
|
ivan |
I usually combine that with compulsive ctrl-s'ing to .mhtml files in Chrome |
09:46
π
|
Guestiiii |
can you configure firefox so that it calls grabsite whenever you bookmark stuff? |
09:46
π
|
Guestiiii |
and what about wpull not seeming to be maintained anymore? is that any trouble for you? |
09:47
π
|
ivan |
you can dump your history from Firefox https://www.gwern.net/Archiving-URLs#batch-job-downloads |
09:47
π
|
ivan |
wpull 1.2.3 works pretty well for me other than slowness |
09:47
π
|
Guestiiii |
ok, good |
09:47
π
|
ivan |
also it not being a headless browser but that's a whole other deal |
09:48
π
|
Guestiiii |
which means pages don't really look the same? |
09:48
π
|
ivan |
it doesn't run javascript and scroll pages and click on ajax things |
09:48
π
|
Guestiiii |
anyway, thanks for grab-site, I'll give it a run! |
09:49
π
|
Guestiiii |
yeah |
09:49
π
|
Guestiiii |
but I saw this "phantom-js" thing |
09:49
π
|
ivan |
if you want to save pages post-javascript-execution both Firefox and Chrome have a ctrl-s that can do that |
09:49
π
|
ivan |
chromebot in #archivebot can do it too |
09:49
π
|
ivan |
also webrecorder.io |
09:50
π
|
Guestiiii |
mmh, so pages that are blank pre-js don't yield good archives with wpull? |
09:50
π
|
ivan |
usually not |
09:50
π
|
ivan |
sometimes they are blank but have content blanked with css |
09:50
π
|
Guestiiii |
ah, shit |
09:50
π
|
Guestiiii |
and what does the wayback machine work? |
09:50
π
|
Guestiiii |
*how |
09:51
π
|
ivan |
wayback is totally insane, it copies javascript to their domain and tries to execute it there |
09:51
π
|
Guestiiii |
mmh, it's all very hacky it seems :) |
09:52
π
|
ivan |
I mean your browser lands on web.archive.org and tries to execute archived javascript files (sometimes coming a newer snapshot or from liveweb) |
09:53
π
|
ivan |
sometimes it even works |
09:54
π
|
Guestiiii |
but then, why couldn't you do that with wpull ? I mean wpull downloads html+js+everything, and when you open the archive in browser, it runs js ? |
09:54
π
|
Flashfire |
Hold Ivan grabsite is a GUI wrapper for Wpull yes? Have you considered making a GUI for snscrape |
09:54
π
|
ivan |
Flashfire: I recommend getting good at the command line |
09:55
π
|
ivan |
command line operations can be composed and automated |
09:55
π
|
ivan |
Guestiiii: that will sometimes work but often not because javascript makes assumptions about paths and headers and origin policy |
09:55
π
|
Guestiiii |
I see |
09:56
π
|
ivan |
Flashfire: for example if you want something to happen after the current command finishes you can use ; or && or just use the typeahead buffer to type the next command |
09:57
π
|
Flashfire |
ivan I find the command line interface confusing to be honest not to mention High Sierra fucks over everything |
09:57
π
|
ivan |
there's no getting around getting good at the command line |
09:57
π
|
kiska |
You have access to that machine thats in LA |
09:57
π
|
kiska |
Try and get familiar with ubuntu |
09:59
π
|
Guestiiii |
(and maybe use fish instead of bash ) |
09:59
π
|
Flashfire |
Alright I might give dual booting a shot once I clear out my 100GB of error files from my desktop |
10:00
π
|
kiska |
Instead of doing ssh -d etc etc, for using that as a netflix proxy. Just do ssh flashfire@server... and mess around with shell |
10:00
π
|
ivan |
man don't tell a shell newbie to use fish |
10:00
π
|
Flashfire |
Dont ask how I got 100GB of error files if someone wants to find out for me be my guest |
10:01
π
|
Guestiiii |
ok :( |
10:01
π
|
ivan |
it's not a POSIX sh and you get screwed if you make it your default on your servers (which shell code do you send over?) |
10:01
π
|
Flashfire |
Wait .... |
10:01
π
|
Guestiiii |
why is that, though? |
10:01
π
|
Guestiiii |
yeah, ok |
10:01
π
|
Guestiiii |
but it's more *friendly* ! |
10:01
π
|
ivan |
you can get a similar-ish experience with zsh-autosuggestions |
10:02
π
|
Guestiiii |
yeah, but I meant the syntax is cleaner, the constructs, etc |
10:02
π
|
Guestiiii |
it's a more reasonable language imo |
10:02
π
|
ivan |
give it time |
10:07
π
|
kiska |
Flashfire: Here is a nice and concise tutorial on Linux and the shell environment http://docs.linuxtone.org/ebooks/Shell/Linux%20Shell%20Scripting%20Tutorial%20v2.0.pdf |
10:09
π
|
Flashfire |
Alright bookmarked that and what kiska sent me will look at it tommorow |
10:10
π
|
kiska |
Have fun! |
10:11
π
|
kiska |
And as I said before, you can't do any damage to the instance since you don't have sudo permissions |
10:11
π
|
Flashfire |
In the mean time I cant get snscrape to run lol |
10:11
π
|
kiska |
xD |
10:12
π
|
Guestiiii |
what's snscrape? |
10:13
π
|
Guestiiii |
ok I see |
10:13
π
|
kiska |
Flashfire on my server just do "pip3 install --user git+https://github.com/JustAnotherArchivist/snscrape.git" |
10:13
π
|
Flashfire |
A Social media scraper made by our own JAA |
10:13
π
|
kiska |
And it should install, then you can use it |
10:14
π
|
Flashfire |
Alright how on that server will I find my successful scrapes? |
10:15
π
|
ivan |
haha you can damage a linux without root |
10:15
π
|
Flashfire |
If Ivan says there is a way through dumb luck I will find it |
10:15
π
|
ivan |
Flashfire: snscrape writes to stdout |
10:15
π
|
ivan |
you can redirect stdout with > file |
10:16
π
|
Flashfire |
Ok so I screwed up with an error already xd |
10:16
π
|
Flashfire |
xD |
10:16
π
|
HCross |
JAA: Brozzler worked.. for about 5 minutes |
10:17
π
|
Flashfire |
See I screwed that up as well |
10:17
π
|
HCross |
until the Chrome process walked off somewhere, and bought the whole thing down |
10:32
π
|
HCross |
yeah, theres something on lolking that Brozzler doesnt like |
10:40
π
|
godane |
latest scan : reference-series-v5i4 |
10:40
π
|
godane |
latest scan : https://archive.org/details/reference-series-v5i4 |
10:41
π
|
Guestiiii |
gtg, thanks for the help! |
10:41
π
|
|
Guestiiii has quit IRC (Remote host closed the connection) |
10:44
π
|
godane |
all of my scans in the last few days : https://www.patreon.com/posts/digitize-scans-21717034 |
11:00
π
|
Flashfire |
Someone scrape https://mobile.twitter.com/marty_balin?lang=en |
11:00
π
|
Flashfire |
he died |
11:20
π
|
eientei95 |
JAA: Can you run your SNScraper on https://twitter.com/marty_balin |
11:25
π
|
eientei95 |
dxrt: When did JAA release their scraper? |
11:25
π
|
kiska |
https://transfer.sh/o7AMJ/@mart_balin_twitter |
11:25
π
|
dxrt |
2-3 weeks ago |
11:26
π
|
eientei95 |
Mind linking it? |
11:27
π
|
kiska |
https://github.com/JustAnotherArchivist/snscrape |
11:27
π
|
eientei95 |
thx |
11:27
π
|
eientei95 |
TIL kiska == dxrt |
11:27
π
|
kiska |
xD |
11:27
π
|
kiska |
In bed so my typing speed is trash |
11:27
π
|
Flashfire |
http://destyy.com A URL shortener |
11:29
π
|
kiska |
Damn my slow typing speed... |
12:14
π
|
|
schbirid has joined #archiveteam-bs |
12:32
π
|
|
BartoCH has quit IRC (Ping timeout: 615 seconds) |
12:35
π
|
godane |
i found out i was missing episode 269 of simply kpop |
12:35
π
|
godane |
i have downloaded it and uploading it now |
12:51
π
|
JAA |
HCross: :-| Well, at least you're making some progress. |
12:51
π
|
HCross |
I ran it twice, both times it hit a URL and then chrome went "nope, fuck off" and died |
12:56
π
|
JAA |
Interesting. Do you know which URL, and can you reproduce it in a normal Chromium instance? |
12:56
π
|
HCross |
ill restart it and see if I can catch it |
12:57
π
|
HCross |
interesting.. restarted it and it seems to have picked up and carried on |
13:10
π
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
13:13
π
|
HCross |
JAA: urllib3.exceptions.ConnectTimeoutError: (<urllib3.connection.HTTPConnection object at 0x7fd744572c50>, 'Connection to ucs02.engageya.com timed out. (connect timeout=60)') |
13:23
π
|
|
RichardG has joined #archiveteam-bs |
13:30
π
|
JAA |
HCross: Huh, and that crashes Chromium? |
13:30
π
|
HCross |
yes |
13:30
π
|
JAA |
Wow |
13:38
π
|
HCross |
switched to actual proper google chrome |
13:43
π
|
|
BartoCH has joined #archiveteam-bs |
13:48
π
|
|
closure has joined #archiveteam-bs |
14:00
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
14:00
π
|
|
closure_ has joined #archiveteam-bs |
14:32
π
|
|
closure_ has quit IRC (Read error: Connection reset by peer) |
14:33
π
|
|
closure has joined #archiveteam-bs |
14:58
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
14:59
π
|
JAA |
So I just looked a bit into tian.yam.com, and it doesn't look good. |
15:00
π
|
JAA |
There's a sitemap at https://tian.yam.com/site/sitemapindex, but that's definitely incomplete. |
15:00
π
|
|
closure has joined #archiveteam-bs |
15:00
π
|
JAA |
The site relies on JS and doesn't work at all without it. |
15:00
π
|
kiska |
Their "sitemap" contains... a lot of subdomains |
15:00
π
|
JAA |
The posts themselves do though. |
15:01
π
|
JAA |
It seems that each user (?) has its own subdomain. |
15:01
π
|
JAA |
And we only have about 37 hours until the shutdown. |
15:01
π
|
kiska |
"An internal server error occurred." At the end of the sitemap... |
15:02
π
|
kiska |
Not good, the server is already having some issues |
15:03
π
|
JAA |
Oh right, issed that. |
15:03
π
|
JAA |
missed* |
15:03
π
|
kiska |
Here is another sitemap http://member.yam.com/SiteMap/?Type=1 |
15:03
π
|
kiska |
However its in Chinese... |
15:04
π
|
JAA |
Also, if those numbers in the post URLs are post IDs, there are over 200 million posts... |
15:05
π
|
jut_ |
Could a warrior project get through that in a day and a half? |
15:08
π
|
JAA |
If we already had everything set up and their servers can handle our DPoS (and don't ban us), maybe. It's over 1500 requests per second though, so that's quite likely to cause issues (and requires a fair amount of warriors). |
15:11
π
|
kiska |
Another sitemap: https://service.tian.yam.com/sitemaps_posts.xml |
15:11
π
|
kiska |
One more: https://service.tian.yam.com/sitemaps_albums.xml |
15:12
π
|
|
kevinYang has joined #archiveteam-bs |
15:13
π
|
JAA |
Yeah, it seems that there are two sitemaps per subdomain. |
15:14
π
|
kiska |
Let me try a baidu search to discover more sitemaps/subdomains |
15:26
π
|
kiska |
Using site:tian.yam.com on Google yields: "About 1,120,000 results (0.24 seconds) "... |
15:32
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
15:32
π
|
|
closure_ has joined #archiveteam-bs |
15:34
π
|
kiska |
Looks like tian.yam.com is using cloudflare... |
15:44
π
|
|
LogicalDa has joined #archiveteam-bs |
15:46
π
|
LogicalDa |
I'm trying to find out if the Warrior has the arbitrary code execution vulnerabilities that were discovered in Alpine Linux a couple weeks ago https://www.securityweek.com/code-execution-alpine-linux-impacts-containers |
15:51
π
|
kiska |
JAA: Discovered some subdomains using SecurityTrails |
16:01
π
|
|
closure_ has quit IRC (Read error: Connection reset by peer) |
16:04
π
|
|
closure has joined #archiveteam-bs |
16:24
π
|
kiska |
JAA: There seems to be a search feature that we can abuse... |
16:33
π
|
|
closure_ has joined #archiveteam-bs |
16:33
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
16:48
π
|
kiska |
JAA: On kiskaAus I have a FDNS archive, its named 2018-08-24-1535112288-fdns_a.json.gz it may have some tian.yam.com subdomains that we can discover |
16:49
π
|
JAA |
kiska: What's FDNS? |
16:50
π
|
kiska |
JAA: https://opendata.rapid7.com/sonar.fdns_v2/ |
16:50
π
|
kiska |
"This dataset contains the responses to DNS requests for all forward DNS names known by Rapid7's Project Sonar. " |
16:52
π
|
JAA |
Ah, nice. |
16:53
π
|
kiska |
So its on kiskaAus, however I don't know how to access this dataset |
16:54
π
|
kiska |
Ah data is structured like this "{"timestamp":"1535156791","name":"\tusa-stemorganics-com.myshopify.com","type":"cname","value":"shops.myshopify.com"}" |
16:55
π
|
kiska |
I did "zcat 2018-08-24-1535112288-fdns_a.json.gz | grep tian.yam.com" I may have caused the hdd to have a seizure xD |
16:56
π
|
JAA |
Hmm, so not the raw DNS data, I see. |
16:56
π
|
kiska |
Here is the first result of that search "{"timestamp":"1535113806","name":"1204697.tian.yam.com","type":"cname","value":"tian-web-1993807926.ap-northeast-1.elb.amazonaws.com"}" |
16:56
π
|
kiska |
This will take a while... |
16:57
π
|
kiska |
If you ssh into the pipeline, its running on Window 9 |
16:57
π
|
|
closure_ has quit IRC (Read error: Operation timed out) |
16:59
π
|
JAA |
kiska: Hmm, might be a good idea to redirect output to a file... |
16:59
π
|
kiska |
I've ^C it, can you do the redirection? |
17:00
π
|
kiska |
And now we wait for it to read 15GB of data at 15MB/s |
17:00
π
|
JAA |
\o/ |
17:01
π
|
|
closure has joined #archiveteam-bs |
17:01
π
|
kiska |
Note: This data is about 1 month old, but it should still be valid |
17:06
π
|
JAA |
kiska: Should I run it on my server instead? The download would be done in about 12 minutes, and I can probably grep it much faster since the HDD isn't used for much else at the moment. |
17:06
π
|
JAA |
grepping 1 GB takes ~50 seconds here. |
17:06
π
|
JAA |
decompressing and grepping* |
17:07
π
|
kiska |
Its not the hdd, its the CPU |
17:07
π
|
JAA |
Yeah, my CPU isn't used much either at the moment. |
17:07
π
|
kiska |
Neither is mine |
17:07
π
|
kiska |
The process is single threaded, so my X5570 IPC isn't that good |
17:08
π
|
kiska |
But I say let it run, and see what happens |
17:08
π
|
JAA |
Ah right, yeah nevermind then. |
17:09
π
|
JAA |
Yeah, not much slower than on my machine. 1 minute to decompress and grep 1 GB. |
17:11
π
|
kiska |
I am going to say, even if the entire data set is in memory, it wouldn't be faster |
17:13
π
|
JAA |
Yeah, decompression seems to be the bottleneck. |
17:15
π
|
kiska |
So I would say, let it keep doing what its doing, come back in about 15 minutes and see what happens |
17:15
π
|
kiska |
We probably can get a ton of urls to process this way |
17:17
π
|
JAA |
Yeah :-) |
17:17
π
|
kiska |
And for future projects as well |
17:18
π
|
JAA |
kiska: Uh, what does "tail: tiam.yam.com-fdns: file truncated" mean? |
17:18
π
|
JAA |
Window 10 |
17:18
π
|
JAA |
File's empty now. |
17:18
π
|
kiska |
... |
17:19
π
|
kiska |
Oh... I am running multiple ssh sessions... |
17:33
π
|
|
closure_ has joined #archiveteam-bs |
17:33
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
17:35
π
|
kiska |
JAA: It finished, and grabbed less urls than I expected |
17:40
π
|
|
ndiddy has joined #archiveteam-bs |
17:43
π
|
kiska |
Right... I am going to grab a beefier server and see how it processes that gzip |
17:56
π
|
|
Odd0002_ has joined #archiveteam-bs |
17:56
π
|
|
Jens has quit IRC (Read error: Operation timed out) |
17:57
π
|
|
actually_ has quit IRC (Read error: Operation timed out) |
17:57
π
|
|
faolingfa has quit IRC (Read error: Connection reset by peer) |
17:57
π
|
|
Odd0002 has quit IRC (Read error: Connection reset by peer) |
17:57
π
|
|
Odd0002_ is now known as Odd0002 |
17:57
π
|
|
fenn_ has joined #archiveteam-bs |
17:57
π
|
|
robogoat has quit IRC (Read error: Operation timed out) |
17:57
π
|
|
robogoat has joined #archiveteam-bs |
17:58
π
|
|
Muad-Dib has quit IRC (Ping timeout: 260 seconds) |
17:58
π
|
|
pikhq has joined #archiveteam-bs |
17:58
π
|
|
pikhq_ has quit IRC (Read error: Connection reset by peer) |
17:58
π
|
|
closure_ has quit IRC (Read error: Operation timed out) |
17:58
π
|
|
fenn has quit IRC (Read error: Connection reset by peer) |
17:59
π
|
|
obskyr has joined #archiveteam-bs |
17:59
π
|
|
atomicthu has quit IRC (Read error: Connection reset by peer) |
18:00
π
|
|
atomicthu has joined #archiveteam-bs |
18:00
π
|
|
faolingf_ has joined #archiveteam-bs |
18:01
π
|
|
closure has joined #archiveteam-bs |
18:02
π
|
|
omglolbah has quit IRC (Read error: Operation timed out) |
18:02
π
|
|
omglolbah has joined #archiveteam-bs |
18:03
π
|
|
LordNigh2 has joined #archiveteam-bs |
18:04
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 633 seconds) |
18:04
π
|
|
LordNigh2 is now known as Lord_Nigh |
18:05
π
|
|
phuzion has quit IRC (Ping timeout: 633 seconds) |
18:07
π
|
|
thejsa_ has quit IRC (Ping timeout: 633 seconds) |
18:09
π
|
|
jrwr has quit IRC (Ping timeout: 633 seconds) |
18:10
π
|
|
jrwr has joined #archiveteam-bs |
18:10
π
|
|
thejsa has joined #archiveteam-bs |
18:10
π
|
|
faolingfa has joined #archiveteam-bs |
18:11
π
|
|
mundus20- has quit IRC (Ping timeout: 633 seconds) |
18:11
π
|
|
Muad-Dib has joined #archiveteam-bs |
18:11
π
|
|
mundus201 has joined #archiveteam-bs |
18:12
π
|
|
faolingf_ has quit IRC (Ping timeout: 633 seconds) |
18:12
π
|
|
ReimuHaku has quit IRC (Ping timeout: 633 seconds) |
18:17
π
|
|
Darkstar has quit IRC (Read error: Connection reset by peer) |
18:17
π
|
kiska |
JAA: Do you think I can request this data set? https://ant.isi.edu/datasets/requests.html |
18:18
π
|
kiska |
I believe I can get access to their dns dataset |
18:18
π
|
|
Darkstar has joined #archiveteam-bs |
18:20
π
|
|
bztoot has quit IRC (Ping timeout: 633 seconds) |
18:20
π
|
|
t2t2 has joined #archiveteam-bs |
18:21
π
|
|
wp494 has quit IRC (Read error: Operation timed out) |
18:21
π
|
|
wp494 has joined #archiveteam-bs |
18:22
π
|
|
phuzion has joined #archiveteam-bs |
18:26
π
|
|
Jens has joined #archiveteam-bs |
18:27
π
|
|
ReimuHaku has joined #archiveteam-bs |
18:29
π
|
|
ndiddy has quit IRC (Read error: Operation timed out) |
18:33
π
|
|
schbirid2 has joined #archiveteam-bs |
18:34
π
|
|
closure has quit IRC (Read error: Operation timed out) |
18:37
π
|
|
squires has quit IRC (Read error: Operation timed out) |
18:37
π
|
|
djsundog has quit IRC (Read error: Operation timed out) |
18:37
π
|
|
dxrt_ has quit IRC (Read error: Operation timed out) |
18:37
π
|
|
closure_ has joined #archiveteam-bs |
18:37
π
|
|
nightpool has quit IRC (Read error: Operation timed out) |
18:38
π
|
|
nightpool has joined #archiveteam-bs |
18:38
π
|
|
sep332 has quit IRC (Read error: Operation timed out) |
18:38
π
|
|
LogicalDa has quit IRC (Read error: Operation timed out) |
18:38
π
|
|
schbirid has quit IRC (Read error: Operation timed out) |
18:38
π
|
|
Albardin has quit IRC (Read error: Operation timed out) |
18:38
π
|
|
RedType has quit IRC (Read error: Operation timed out) |
18:39
π
|
|
beardicus has quit IRC (Read error: Operation timed out) |
18:39
π
|
|
ivan has quit IRC (Read error: Operation timed out) |
18:40
π
|
|
ivan has joined #archiveteam-bs |
18:43
π
|
|
kiska1 has quit IRC (Read error: Operation timed out) |
18:43
π
|
|
logchfoo1 has quit IRC (Ping timeout: 601 seconds) |
18:44
π
|
|
logchfoo2 starts logging #archiveteam-bs at Sat Sep 29 18:44:54 2018 |
18:44
π
|
|
logchfoo2 has joined #archiveteam-bs |
18:45
π
|
|
LogicalDa has joined #archiveteam-bs |
18:51
π
|
|
Mayonaise has quit IRC (Ping timeout: 600 seconds) |
18:53
π
|
|
REiN^ has quit IRC (Ping timeout: 600 seconds) |
18:54
π
|
|
unlobito has quit IRC (Ping timeout: 600 seconds) |
18:54
π
|
|
unlobito has joined #archiveteam-bs |
18:54
π
|
|
Mayonaise has joined #archiveteam-bs |
18:58
π
|
|
Albardin has joined #archiveteam-bs |
18:59
π
|
|
closure_ has quit IRC (Read error: Connection reset by peer) |
18:59
π
|
|
closure has joined #archiveteam-bs |
19:03
π
|
|
TigerbotH has quit IRC (Ping timeout: 600 seconds) |
19:04
π
|
|
PotcFdk has quit IRC (Ping timeout: 600 seconds) |
19:04
π
|
|
kiska1 has joined #archiveteam-bs |
19:05
π
|
|
beardicus has joined #archiveteam-bs |
19:05
π
|
|
closure has quit IRC (Read error: Operation timed out) |
19:06
π
|
|
zino_ is now known as zino |
19:12
π
|
JAA |
kiska: That looks interesting. I'll need to take a closer look at it later. |
19:13
π
|
|
ndiddy has joined #archiveteam-bs |
19:14
π
|
|
PhrackD has quit IRC (Ping timeout: 600 seconds) |
19:23
π
|
|
schbirid2 has quit IRC (Remote host closed the connection) |
19:24
π
|
|
REiN^ has joined #archiveteam-bs |
19:24
π
|
|
TigerbotH has joined #archiveteam-bs |
19:24
π
|
|
sep332 has joined #archiveteam-bs |
19:25
π
|
|
C4K3 has joined #archiveteam-bs |
19:26
π
|
|
closure has joined #archiveteam-bs |
19:27
π
|
|
squires has joined #archiveteam-bs |
19:28
π
|
|
PhrackD has joined #archiveteam-bs |
19:28
π
|
|
PotcFdk has joined #archiveteam-bs |
19:29
π
|
|
djsundog has joined #archiveteam-bs |
19:29
π
|
|
dxrt_ has joined #archiveteam-bs |
19:30
π
|
|
RedType has joined #archiveteam-bs |
19:31
π
|
|
closure has quit IRC (Read error: Operation timed out) |
19:33
π
|
|
closure has joined #archiveteam-bs |
20:00
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
20:00
π
|
|
closure_ has joined #archiveteam-bs |
20:10
π
|
godane |
so i won that ebay auction |
20:33
π
|
|
closure_ has quit IRC (Ping timeout: 268 seconds) |
21:03
π
|
|
closure has joined #archiveteam-bs |
21:32
π
|
|
closure has quit IRC (Read error: Operation timed out) |
21:36
π
|
|
closure_ has joined #archiveteam-bs |
21:37
π
|
|
n00b227 has joined #archiveteam-bs |
21:37
π
|
|
n00b227 has quit IRC (Client Quit) |
21:39
π
|
|
n00b985 has joined #archiveteam-bs |
21:39
π
|
n00b985 |
Not specifically for archiveteam, but how would I add files to an existing archive.org item with the cli? |
21:44
π
|
JAA |
n00b985: Simpy 'ia upload identifier files' as on the first upload/item creation. Only works if you have access to that item, obviously. |
21:52
π
|
n00b985 |
I tried that, however after uploading the item didn't display on the website (I waited a few hours), nor with ia list. Only the originally uploaded file existed. |
21:59
π
|
JAA |
n00b985: Hmm, it can take a while if there's a derive currently running, but usually that should notice that you've uploaded new files and abort, I think. What's the item? (Feel free to PM if you don't want to share it publicly.) |
22:00
π
|
|
closure_ has quit IRC (Read error: Operation timed out) |
22:00
π
|
n00b985 |
There is a derive already running since it's a video. Will the additional files appear after the first derive is done? |
22:01
π
|
|
closure has joined #archiveteam-bs |
22:02
π
|
JAA |
Yeah, I think so. |
22:07
π
|
n00b985 |
Ok, thanks |
22:23
π
|
|
VerifiedJ has quit IRC (Read error: Operation timed out) |
22:33
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
22:35
π
|
|
closure has joined #archiveteam-bs |
22:59
π
|
|
closure has quit IRC (Read error: Connection reset by peer) |
23:06
π
|
|
closure has joined #archiveteam-bs |
23:23
π
|
|
sknebel_ has joined #archiveteam-bs |
23:25
π
|
|
bakJAA_ has joined #archiveteam-bs |
23:25
π
|
|
swebb sets mode: +o bakJAA_ |
23:25
π
|
|
JAA sets mode: +o bakJAA_ |
23:26
π
|
|
sknebel has quit IRC (Ping timeout: 492 seconds) |
23:28
π
|
|
bakJAA has quit IRC (Ping timeout: 492 seconds) |
23:31
π
|
|
closure has quit IRC (Read error: Operation timed out) |
23:32
π
|
|
BlueMax has joined #archiveteam-bs |
23:33
π
|
|
n00b985 has quit IRC (Quit: Page closed) |
23:33
π
|
|
closure has joined #archiveteam-bs |
23:39
π
|
|
mgrytbak has quit IRC (Ping timeout: 492 seconds) |
23:40
π
|
|
mgrytbak_ has joined #archiveteam-bs |
23:48
π
|
|
mgrytbak_ has quit IRC (Ping timeout: 492 seconds) |
23:54
π
|
|
mgrytbak^ has joined #archiveteam-bs |
23:55
π
|
|
Atom-- has quit IRC (Read error: Connection reset by peer) |
23:56
π
|
|
faolingfa has quit IRC (Read error: Connection reset by peer) |
23:56
π
|
|
faolingfa has joined #archiveteam-bs |
23:56
π
|
|
Atom-- has joined #archiveteam-bs |
23:57
π
|
|
dxrt has quit IRC (Quit: ZNC - http://znc.sourceforge.net) |
23:57
π
|
|
dxrt has joined #archiveteam-bs |
23:57
π
|
|
hook54321 has quit IRC (Ping timeout: 252 seconds) |
23:58
π
|
|
closure has quit IRC (Read error: Operation timed out) |
23:59
π
|
|
Frogging has quit IRC (Ping timeout: 252 seconds) |
23:59
π
|
|
i0npulse has quit IRC (Ping timeout: 252 seconds) |