Continuing with the Ads.txt crawler has lead to the idea to store the crawler results in a database and have them available from a web site. This post introduces the first pass as such a site with the source code available in the following repository:
As a quick review the Ads.txt standard is one where publishers can host a simple text file with the names of authorized ad networks that have permission to sell the publisher's inventory. There is a reference Python crawler for such files and I've built a crawler in Clojure as an alternative. See this link for a series of posts about the Ads.txt specification and the development of the crawler. The crawler project is here.
Running a free Heroku app which falls asleep? You can use a service to ping it periodically to keep it alive.
Here is one:
So, it looks like the CoinMarketCap has changed their site a bit and it breaks the coinmarketcap.py script I wrote back in July. Many thanks to cool007zqw for pointing this out by adding an issue on the repo.
On investigation it looks like CoinMarketCap changed the url to the page which the script was developed for.
The new page is as follows:
Heroku will only deploy code pushed to the master branch of your heroku remote. Knowing this as the only method to push code to your remote is limiting if you want to deploy from a feature branch.
Luckily, Heroku does support pushing a non-master branch with simple change of syntax.
$ git push heroku feature/my-cool-feature:master
Simply tack on the :master and you can push your feature branch or any other branch for that matter.
If you've found this post and are rushing to upgrade a previously installed version of geth on Linux before the hard fork on October 16th here are a quick set of instructions.
The instructions assume you've built your geth from source. If that is the case the following should be helpful.
Make sure to stop your current geth instance. I used the following command:
The command "lein deploy clojars" returned the following error.
Could not sign /Users/brad/work/github/ads-txt-crawler/target/ads-txt-crawler-0.0.6.jar Cannot run program "gpg": error=20, Not a directory See `lein help gpg` for how to set up gpg.
Being on a Mac I used to install gpg