2012 IEEE 28th International Conference on Data Engineering 2012
DOI: 10.1109/icde.2012.149
|View full text |Cite
|
Sign up to set email alerts
|

Earlybird: Real-Time Search at Twitter

Abstract: Abstract-The web today is increasingly characterized by social and real-time signals, which we believe represent two frontiers in information retrieval. In this paper, we present Earlybird, the core retrieval engine that powers Twitter's realtime search service. Although Earlybird builds and maintains inverted indexes like nearly all modern retrieval engines, its index structures differ from those built to support traditional web search. We describe these differences and present the rationale behind our design… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
142
0
1

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 138 publications
(143 citation statements)
references
References 43 publications
0
142
0
1
Order By: Relevance
“…This goes along the way of the system stack starting from logging [18] and machine learning techniques [21] to indexing [5], [7], [42], [43] and designing a SQL-like query language interface [24]. In addition, several efforts have focused on analyzing microblog data, which include semantic and sentiment analysis [3], [28], [30], decision making [6], news extraction [35], event and trend detection [1], [19], [27], [34], [37], understanding the characteristics of microblog posts and search queries [22], [33], microblogs ranking [11], [39], and recommending users to follow or news to read [14], [32].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…This goes along the way of the system stack starting from logging [18] and machine learning techniques [21] to indexing [5], [7], [42], [43] and designing a SQL-like query language interface [24]. In addition, several efforts have focused on analyzing microblog data, which include semantic and sentiment analysis [3], [28], [30], decision making [6], news extraction [35], event and trend detection [1], [19], [27], [34], [37], understanding the characteristics of microblog posts and search queries [22], [33], microblogs ranking [11], [39], and recommending users to follow or news to read [14], [32].…”
Section: Related Workmentioning
confidence: 99%
“…Microblog Search Queries. Real-time search on microblogs often refers to keyword search [5], [7], [42], [43]. The difference of one technique over the other is mainly in the query type, accuracy, ranking function, and memory management.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Earlybird [2] is the proprietary distributed system developed by Twitter to overcome the practical system issues inside Twitter such as inverted index organization and concurrent control. The most recent academic work on real-time microblog indexing and search is Pollux [17] which is also a distributed system aiming to solve the concerns of fault tolerance and global storage effectiveness.…”
Section: B Microblog and Social Media Data Management In Generalmentioning
confidence: 99%
“…They are not suitable for searching short-sized social media data that are characterized by few keywords. Twitter itself also offers a real-time 1 http://blog.twitter.com/2011/02/superbowl.html search service 2 , which returns highly-ranked tweets in response to user-input keywords. However, the spatial aspect is not handled in the search service.…”
Section: Introductionmentioning
confidence: 99%