📝 Add suggestion to fix ML model and encourage data gathering for metric impact in Kaggle-style competition

davidgasquez.com / handbook

fork atom

📚 Personal bits of knowledge

fork atom

davidgasquez.com 7 months ago e6b5bd09 3c1ee290

1 changed file

expand all

unified split

Ideas.md

··· 103 103 - A distance metric is used to measure accuracy and value of the submitted datasets. 104 104 - With enough participants, the final dataset could be a function of the best submissions (e.g: a vote where each row is decided as the most common submitted rows from top participants or the average between them) 105 105 - Could also be another twist on Kaggle, mixing the ML competition aspects with Prediction Markets (e.g: polymarket). 106 + - [Fix the ML model, encourage participants to gather the data and measure based on the impact on the final metric](https://www.dataperf.org/training-set-acquisition). 106 107 - Similar to Numerai, participants send submissions and stake some amount of money. 107 108 - The best submissions are selected and the money is distributed among the participants depending on their stake. 108 109 - For each task, rewards are given per row/prediction, not per model. This encourages a [[Plurality]] of models to be used that specialize in different tasks.