Welcome to an experiment!
You mission, should you choose to accept it, is to make the best lithology prediction you can. We want you to try to beat the accuracy score Brendon Hall achieved in his Geophyscial Tutorial (TLE, October 2016).
First, read the open access tutorial by Brendon in the October issue of The Leading Edge.
Here's the text of that box again:
I hope you enjoyed this month's tutorial. It picks up on a recent wave of interest in artificial intelligence approaches to prediction. I love that Brendon shows how approachable the techniques are — the core part of the process only amounts to a few dozen lines of fairly readable Python code. All the tools are free and open source, it's just a matter of playing with them and learning a bit about data science.
In the blind test, Brendon's model achieves an accuracy of 43% with exact facies. We think the readers of this column can beat this — and we invite you to have a go. The repository at github.com/seg/2016-ml-contest contains everything you need to get started, including the data and Brendon's code. We invite you to find a friend or two or more, and have a go!
To participate, fork that repo, and add a directory for your own solution, naming it after your team. You can make pull requests with your contributions, which must be written in Python, R, or Julia. We'll run them against the blind well — the same one Brendon used in the article — and update the leaderboard. You can submit solutions as often as you like. We'll close the contest at 23:59 UT on 31 January 2017. There will be a goody bag of completely awesome and highly desirable prizes for whoever is at the top of the leaderboard when the dust settles. The full rules are in the repo.
Have fun with it, and good luck!
All the code and data to reproduce everything in that article is right here in this repository. You can read the code in a Jupyter Notebook here...
See the February issue of The Leading Edge for Matt Hall's user guide to the tutorials; it explains how to run a Jupyter Notebook.
See Running the notebook live (below) for information on running that noteobook live right now this minute in your web browser.
To make it even easier to try machine learning for yourself, you can launch this notebook on mybinder.org and run it there. You can load the data, change the code, and do everything... except enter the contest. Everything on your mybinder.org machine is temporary. If you make something awesome, be sure to use File > Download as... > Notebook (.ipynb) to save it locally. Then you can fork the repo in GitHub, add your new notebook, and make your pull request.
We've never done anything like this before, so there's a good chance these rules will become clearer as we go. We aim to be fair at all times, and reserve the right to make judgment calls for dealing with unforeseen circumstances.
© 2016 SEG, licensed CC-BY, please share this!