Finish inference script
I am finishing the inference script right now. Encountering several bugs, though...
- Debug cluster-based expected distributions
- Debug predict_with_inchikey
- Check what happens with the performance (remove 1 true positive)
- Tune alpha parameter
- Design a good validation set
- Run predictions and double-check with oriol that the web-checker doesn't do anything funny with it