Byron J. Gao
Office: Neuces 225 / Email: / Phone: (512) 245-0348 / Fax: (512) 245-8750
Department of Computer Science, Texas State University, 601 University Drive, San Marcos, TX 78666

Byron J. Gao received Ph.D. and B.Sc. in Computer Science from Simon Fraser University in 2007 and 2003 respectively. He was a postdoctoral fellow at the University of Wisconsin before joining Texas State University in 2008. His research spans several related fields of data mining, databases, information retrieval, and bioinformatics.

Spring 2014:
CS 4378U: Introduction to Data Mining and Information Retrieval
CS 5369U: Data Mining and Information Retrieval

Fall 2014:
CS 3378: Theory of Automata
CS 5338: Formal languages (please note that I do not process override requests. I only teach.)

Recent publications:
* E-Tree: An Efficient Indexing Structure for Ensemble Models on Data Streams. IEEE Transactions on Knowledge and Data Engineering (TKDE).
* VSRank: A Novel Framework for Ranking-based Collaborative Filtering. ACM Transactions on Intelligent Systems and Technology (TIST).
Other publications: DBLP
Links: Lab, Group, REUIR, REUIOT
Misc: Best-of-SDM'06 research paper, Best research paper candidate in KDD'07, Erdos number 3 via P. Erdos -> L. Babai -> J. Cai -> me :-)

Professional services
PC member: ECML/PKDD'14, ICDM'14, WWW'14 (demo track), ICDM'13, ICDM'12, SDM'12, ICDM'11, ECML/PKDD'11, SIGKDD'11, SDM'11, SIGKDD'10
Reviewer: TKDE, TKDD, DMKD, TEVC, KAIS, DKE, Discrete Applied Mathematics, Computational Intelligence, Applied Intelligence, TSMCB, ESWA

To potential students
- For thesis: In general, you need to take a course/independent study with me first. Check out the following keywords and see if they resonate with you:
data mining, databases, information retrieval, web search, exploratory search, faceted search, clustering, classification, machine learning, bioinformatics, Internet of things, social networks, information extraction, opinion mining, recommendation systems, community information management, personal information management, wikipedia, mass collaboration, craigslist, gene expression analysis, statistics, discrete optimization, web service, LAMP, Hadoop ...

- For independent study: I'm always interested in working with undergraduate and graduate students on fun projects. Check out the following examples:
ClusteringWiki, MASFA