#archiveteam-bs 2017-12-02,Sat

↑back Search ←Prev date Next date→ (Showing only urls - See all)(Click on time to show url line in full context)

WhoWhatWhen
ola_norski feel naughty doing curl requests to https://web.archive.org/save/https://twitter.com/hashtag/netneutrality?f=tweets , currently every 3rd minute :/ [00:07]
JAAhttps://medium.com/vidme/goodbye-for-now-120b40becafa [00:16]
ola_norskhttps://archive.org/details/jscott_geocities [00:21]
JAAhttps://docs.vid.me/#api-Videos-List [00:38]
http://www.supersimplestorageservice.com/ [00:43]
dashcloud@ola_norsk If you're interested in how to make something be emulated on IA, here's some pages that lay it out for you- http://digitize.archiveteam.org/index.php/Internet_Archive_Emulation http://digitize.archiveteam.org/index.php/Making_Software_Emulate_on_IA [00:47]
JAAI can't find any information about API rate limits, except this Reddit thread: https://redd.it/6acvg5 [01:09]
ola_norsk"The Internet is Living on Borrowed Time" .. https://vid.me/1LriY (ironically on vid.me) ..That's pretty dark title, for being Lunduke :d [01:26]
JAATo be fair, it's also available on YouTube: https://www.youtube.com/watch?v=1VD_pJOFnZ0 [01:33]
ola_norskranma: "German INVASION"...100k creators..https://vid.me/JjNaH [02:13]
ranmajust noticed archivebot slurped down https://ftp.modland.com/ [02:22]
ola_norskdd -i http://google.com -o http://bing.com [02:23]
ranma<Major> Muad-Dib: Your job for https://ftp.modland.com/ has finished. [02:23]
ezfor anyone wanting to mirror vid.me, its possible to page everything there: https://api.vid.me/videos/list?minVideoId=100&maxVideoId=1000 [02:34]
CoolCanuk..... https://usercontent.irccloud-cdn.com/file/PZalOsZ6/image.png [02:34]
bithippoSuch as https://github.com/jjjake/internetarchive ? [02:40]
MrRadarX-posting from #archiveteam: if you're using youtube-dl to grab vid.me content, be aware of this issue: https://github.com/rg3/youtube-dl/issues/14199 [03:47]
wp494posting highlights of https://www.youtube.com/watch?v=KMaWSinw4MI&t=41m33s here [03:54]
FroggingI'm reading this https://np.reddit.com/r/bapcsalescanada/comments/77h771/for_anyone_that_purchased_a_8700k_from_ncix/domm2ca/?context=3 [03:57]
wp494https://vid.me/media [05:42]
CoolCanukhttp://tracker.archiveteam.org/ [05:51]
https://upload.wikimedia.org/wikipedia/commons/3/35/Tux.svg [05:57]
wp494see how the miiverse logo goes a bit out of its bounds and pushes content downwards: https://i.imgur.com/P3Wcfbp.png [06:03]
now take the version of the steam icon we had stored on the wiki and stuffed into the project code (http://www.archiveteam.org/images/4/48/Steam_Icon_2014.png) and it wound up being a bit worse than that example
luckily a 100px version that mediawiki gracefully generated more or less solved things: https://github.com/ArchiveTeam/spuf-grab/pull/2/commits/1c319d3d144cc13599f1fe571e699ca8b3d79e60
[06:04]
note how it looks like it's fine on http://tracker.archiveteam.org/ [06:05]
schbiridhttps://www.hetzner.com/sb [13:43]
odemghttps://medium.com/vidme/goodbye-for-now-120b40becafa
https://medium.com/vidme/goodbye-for-now-120b40becafa
https://medium.com/vidme/goodbye-for-now-120b40becafa
[13:49]
shindakundon't know if it will help but i was made a brute force video/metadata downloader for vidme https://github.com/shindakun/vidme i don't really have the bandwidth or storage to let it run though [17:06]
ola_norskmade a test C64/dosbox emulator item (https://archive.org/details/iaCSS64_test) , but it seems very slow. At least on my potato pc. [17:11]
CoolCanuki'd use something like this http://xmlgrid.net/xml2text.html . then get rid of the non urls in excel/google sheets. [18:14]
ola_norskif you have the links in a list; curl --silent --max-time 120 --connect-timeout 30 'https://web.archive.org/save/THE_LINK_TO_SAVE' > /dev/null , is a way to save them i think [18:15]
idk :d i just use that as cronjobs to save tweets https://pastebin.com/raw/ZE4udKTi [18:19]
https://pastebin.com/raw/dJrVbnpr
that's what i get when running: curl -H "User-Agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Ubuntu Chromium/62.0.3202.89 Chrome/62.0.3202.89 Safari/537.36" --silent --max-time 120 --connect-timeout 30 'https://web.archive.org/save/https://twitter.com/hashtag/netneutrality?f=tweets'
[18:38]
arkiverhttps://archive.org/details/liveweb?sort=-publicdate [19:08]
ola_norskthis is the mail i wrote on the 27th btw: https://pastebin.com/AV1vbKUr [19:15]
opening a capture in the browser does not seem to work to pull the images https://web.archive.org/web/20171130120002/https:/twitter.com/hashtag/netneutrality?f=tweets [19:32]
CoolCanukThis looks like quite the "portfolio" https://en.wikipedia.org/wiki/List_of_radio_stations_owned_by_Cumulus_Media [19:58]
ola_norskCoolCanuk: https://www.marketwatch.com/investing/stock/cmlsq ..Not sure if it's really indicative though [20:02]
CoolCanuk: All i see is the slope going down :d https://www.marketwatch.com/investing/stock/cmlsq/charts That's basically the max of my knowledge about stocks and shit :d [20:06]
CoolCanuk: that cumulus media thing made my brain conjure up some silly idea https://pastebin.com/raw/32k6st0E [20:57]
JAA: i tried this wget command, wget -O /dev/null --header="Accept: text/html" --user-agent="Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:21.0) Gecko/20100101 Firefox/21.0" --quiet --page-requisites "https://web.archive.org/save/https://twitter.com/hashtag/bogus?f=tweets" ..it's 100% quiet, though it doesn't seem to return more than using curl did. [22:00]
JAA: https://pastebin.com/FKu3mHbh this showes the structure of what it does [22:19]
JAAola_norsk: "Not following https://web.archive.org/save/_embed/https://pbs.twimg.com/profile_images/848200666199629824/ZwvxQIzP_bigger.jpg because robots.txt forbids it." [22:30]
ola_norskJAA: here is output from me running the command (Note, it's in norwegian :/ ) https://pastebin.com/awJ9j4D8 [22:36]
--2017-12-02 23:41:17-- https://web.archive.org/save/_embed/https://abs.twimg.com/a/1512085154/css/t1/images/ui-icons_2e83ff_256x240.png [22:44]
JAABut my command earlier grabbed https://pbs.twimg.com/profile_images/848200666199629824/ZwvxQIzP_bigger.jpg for example. [22:45]
My test earlier grabbed https://pbs.twimg.com/media/DQDHMryX4AEseEo.jpg for example, which is an image from a post most likely (though I'm not going to try and figure out which one). [22:55]
Uhm, dafuq? https://web.archive.org/web/20171202231923/https:/twitter.com/hashtag/bogus?f=tweets [23:22]
The command was wget --page-requisites -e robots=off 'https://web.archive.org/save/https://twitter.com/hashtag/bogus?f=tweets' [23:24]

↑back Search ←Prev date Next date→ (Showing only urls - See all)(Click on time to show url line in full context)