This post was originally part of the DataRobot Community. Visit now to browse discussions and ask questions about DataRobot, AI Platform, data science, and more. W-shingling is a very popular and straightforward technique for text mining. It can be used for classification, clusterization, and other problems. It was optimized and developed by Andrei Zary Broder, a distinguished scientist at Google. …