Ingrid Zukerman scite author profile

Abstract. We describe several Markov models derived from the behaviour patterns of many users, which predict which documents a user is likely to request next. We then present comparative results of the predictive accuracy of the different models, and, based on these results, build hybrid models which combine the individual models in different ways. These hybrid models generally have a greater predictive accuracy than the individual models. The best models will be incorporated in a system for pre-sending WWW documents.

show abstract

Authorship Attribution with Topic Models

Seroussi

Zukerman

Bohnert

2014

Computational Linguistics

View full text Add to dashboard Cite

Authorship attribution deals with identifying the authors of anonymous texts. Traditionally, research in this field has focused on formal texts, such as essays and novels, but recently more attention has been given to texts generated by on-line users, such as e-mails and blogs. Authorship attribution of such on-line texts is a more challenging task than traditional authorship attribution, because such texts tend to be short, and the number of candidate authors is often larger than in traditional settings. We address this challenge by using topic models to obtain author representations. In addition to exploring novel ways of applying two popular topic models to this task, we test our new model that projects authors and documents to two disjoint topic spaces. Utilizing our model in authorship attribution yields state-of-the-art performance on several data sets, containing either formal texts written by a few authors or informal texts generated by tens to thousands of on-line users. We also present experimental results that demonstrate the applicability of topical author representations to two other problems: inferring the sentiment polarity of texts, and predicting the ratings that users would give to items such as movies.

show abstract

Personalised rating prediction for new users using latent factor models

Seroussi

Bohnert

Zukerman

2011

View full text Add to dashboard Cite

Lexical query paraphrasing for document retrieval

Zukerman

Raskutti

2002

View full text Add to dashboard Cite

We describe a mechanism for the generation of lexical paraphrases of queries posed to an Internet resource. These paraphrases are generated using WordNet and part-of-speech information to propose synonyms for the content words in the queries. Statistical information, obtained from a corpus, is then used to rank the paraphrases. We evaluated our mechanism using 404 queries whose answers reside in the LA Times subset of the TREC-9 corpus. There was a 14% improvement in performance when paraphrases were used for document retrieval.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ingrid Zukerman

Towards a Bayesian Model for Keyhole Plan Recognition in Large Domains

Predicting Users’ Requests on the WWW

Authorship Attribution with Topic Models

Personalised rating prediction for new users using latent factor models

Lexical query paraphrasing for document retrieval

Contact Info

Product

Resources

About