Monday, October 30, 2006
Netflix & Rasch partial-credit-model
Modeling each of the 17770 movies in the Netflix database to have its own rating scale (the Rasch-Masters partial-credit model) produces worse predictions than modeling them all to share the same rating scale! And modeling each of the 480,189 customers to have a unique rating scale is even worse!!
This accords with the Rasch proposition that "betterdescription of the local dataset can result in worse inference for other data sets." I was already skeptical of the accidental nature of many partial credit analyses, particularly those with low category frequencies. The Netflix data confirm my skepticism.
This accords with the Rasch proposition that "betterdescription of the local dataset can result in worse inference for other data sets." I was already skeptical of the accidental nature of many partial credit analyses, particularly those with low category frequencies. The Netflix data confirm my skepticism.
Comments:
<< Home
Hi Mike,
Matt Barney - former student of yours here. Curious if this Netflix study has a citation, or was it a private project you were doing for them?
Post a Comment
Matt Barney - former student of yours here. Curious if this Netflix study has a citation, or was it a private project you were doing for them?
<< Home