Using an existing social media dataset that was associated with HIV and coded by an HIV domain expert, they tested whether four commonly used machine learning methods could learn the patterns associated with HIV risk behavior. They used the 10-fold cross validation method to examine the speed and accuracy of these models in applying that knowledge to detect HIV content in social media data.
Machine learning can enable social big data to become a new and important tool in HIV research.