This data set is the UCI: SMS Spam Collection data set formatted for use with the NeuNetS tool. For text, NeuNetS requires that all of the text be in one file named "train.tsv", with each line in the following format:
- <text><tab><class-name>
The SMS Spam Collection is a set of SMS tagged messages that have been collected for SMS Spam research. It contains one set of SMS messages in English of 5,574 messages, tagged acording being ham (legitimate) or spam. This data set is sourced from the UCI Machine Learning Repository.
Citation: Almeida, T.A., Gómez Hidalgo, J.M., Silva, T.P. Towards SMS Spam Filtering: Results under a New Dataset. International Journal of Information Security Science (IJISS), 2(1), 1-18, 2013. For more information, see https://archive.ics.uci.edu/dataset/228/sms+spam+collection