[R] Beyond Prediction – Text Representation for Social Science (arxiv 2603.10130)

A perspective paper on something I think ML/NLP does not discuss enough: representations that are good for prediction are not necessarily good for measurement. In computational social science and psychology, that distinction matters a lot.

The paper frames this as a prediction–measurement gap and discusses what text representations would need to look like if we treated them as scientific instruments rather than just features for downstream tasks. It also compares static vs contextual representations from that perspective and sketches a measurement-oriented research agenda.

submitted by /u/Hub_Pli
[link] [comments]

Liked Liked