Recruiting people from diverse backgrounds to participate in health research requires intentional and culture-driven strategic efforts. In this study, we utilize publicly available Twitter posts to identify targeted populations to recruit for our HIV prevention study. Natural language processing methods were used to find self-declarations of ethnicity, gender, and age group, and advanced classification methods to find sexually-explicit language. Using the official Twitter API and the available tools, Demographer and M3, we identified 4800 users who were likely young Black or Hispanic men living in Los Angeles from an initial collection of 47.4 million tweets posted over 8 months. Despite a limited precision, our results suggest that it is possible to automatically identify users based on their demographic attributes and characterize their language on Twitter for enrollment into epidemiological studies.