You might enjoy the continuing chaos...
(https://vendire-ludorum.blogspot.com/20 ... ippos.html)


I'm thinking logistic regression. I'm really interested to see what the contributing factors are to a horse winning a race, but it seems difficult to nail down.
I wouldnt necessarily say its difficult to nail down the attributes as such. You have the ones you mention above but also things like race class, rating, ground, etc. The difficulty comes with knowing how to weight the attributes in regards to the race that day. Different things matter depending on the setup of that race. You may have heavy ground which is going to have more of an effect on how horses will perform than say good to soft which is more optimum for most. So you would need to take that into account. Rating has most use in handicaps where it dictates how the horses are weighted. Something dropping down in mark might have reached something workable for today's contest or may just be falling off a cliff. Trip is another one where you have specialist distances like 7f, where previous form at this trip would be beneficial. I think this is why its so difficult to gauge what the odds should be of a given animal in a race.Euler wrote: ↑Fri Apr 10, 2026 9:33 amI'm thinking logistic regression. I'm really interested to see what the contributing factors are to a horse winning a race, but it seems difficult to nail down.
I really understand in play incredibly well now, but pre-off I'm sort of curious as to whether you can infer what the odds should be, given form, lines, progeny, trainer, jockey, course, distance, and so on. It's long been a target of mine to really nail that down.
I know to about a billion decimal places what the price should be in play, but it's more about standard handicapping. Trying to understand how the market is being priced and why.
Good post. That's as good as any out there.
Thats where I struggle. Including the market without just becoming almost a mirror of it. Weighting it too little renders it almost redundant as the other attributes over power it. Weighting it too highly makes my implied probabilities line up closely with it. Guess its finding that middle ground which I havent been able to do yet programmatically. I take what my model spits out and do that manually currently adjusting them with what the market is saying.wearthefoxhat wrote: ↑Fri Apr 10, 2026 11:17 amGood post. That's as good as any out there.
Bill Benter/Alan Woods built their database(s) and drilled down to an Nth degree on each variable. TBH, I don't ever think their success could be replicated, however, through testing, it's possible a winning data model could be constructed.
A sort of rating/tissue/value comparison is then made on all runners, back all those that qualify, or, lay the most obvious ones out of line on the prices.
Nowadays, the books are a bit lazy pre-off, safe in the knowledge they can restrict a winning punter at any time, for any reason. Also, their line into the stables knowledge and betting behaviour is not like it used to be.
If approaching this from a database angle, don't forget to include the live market into the model. BB learned that early lesson, when he did...well the rest is history.