Gossip Journalism: big data can use bright dirty data unfit for publication
Missing from my taxonomy of types of data (as summarized in this post) leaves out mention of dirty data. The different types of data have different levels of trustworthiness where bright data is highly trusted, while model-generated data is less trusted witness of reality. My descriptions of the dim data or model-forbidden data is not…