|
- Faster Learned Sparse Retrieval with Block-Max Pruning.
A. Mallia, T. Suel, and N. Tonellotto.
46th International ACM SIGIR Conference on Research and Development in Information Retrieval,
July 2024.
PDF
- Faster Learned Sparse Retrieval with Guided Traversal.
A. Mallia, J. Mackenzie, T. Suel, and N. Tonellotto.
44th International ACM SIGIR Conference on Research and Development in Information Retrieval,
July 2022.
PDF
- Using Conjunctions for Faster Disjunctive Top-k Queries.
M. Siedlaczek, A. Mallia, and T. Suel.
15th ACM International Conference on Web Search and Data Mining,
March 2022.
PDF
- Optimizing Iterative Algorithms for Social Network Sharding.
Z. Deng and T. Suel.
IEEE International Conference on Big Data, December 2021.
PDF
- Report on the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval.
C. Shah, T. Suel, F. Diaz et al.
SIGIR Record, December 2021.
PDF
- Learning Passage Impacts for Inverted Indexes.
A. Mallia, O. Khattab, T. Suel, and N. Tonellotto.
44th International ACM SIGIR Conference on Research and Development in Information Retrieval,
July 2021.
PDF
- Fast Disjunctive Candidate Generation Using Live Block Filtering.
A. Mallia, M. Siedlaczek, and T. Suel.
14th ACM International Conference on Web Search and Data Mining,
March 2021.
PDF
- Feature Extraction for Large-Scale Text Collections.
L. Gallagher, A. Mallia, S. Culpepper, T. Suel, and B. Cambazoglu.
29th International Conference on Information and Knowledge Engineering,
November 2020.
PDF
- A Comparison of Top-k Threshold Estimation Techniques for
Disjunctive Query Processing.
S. Siedlaczek, A. Mallia, T. Suel, and M. Sun.
29th International Conference on Information and Knowledge Engineering,
November 2020.
PDF
- To index or not to index: Time-space trade-offs for positional ranking
functions in search engines.
Diego Arroyuelo, Senén González, Mauricio Marin, Mauricio Oyarzún, Torsten Suel, and
LuisValenzuela.
Information Systems, November 2019 (accepted for publication).
preprint
- Forward Index Compression for Instance Retrieval in an Augmented Reality
Application.
Qi Wang, Michał Siedlaczek, Yen-Yu Chen, Michael Gormish, and Torsten Suel.
IEEE International Conference on Big Data, December 2019.
PDF
- GPU-Accelerated Decoding of Integer Lists.
Antonio Mallia, Michał Siedlaczek, Torsten Suel, and Mohamed Zahran.
28th International Conference on Information and Knowledge Engineering,
November 2019.
PDF
- Document Reordering for Faster Intersection.
Q. Wang and T. Suel.
45th International Conference on Very Large Data Bases,
August 2019.
PDF
- PISA: Performant Indexes and Search for Academia.
Antonio Mallia, Michał Siedlaczek, Joel Mackenzie, and Torsten Suel.
Proceedings of the Open-Source IR Replicability Challenge (OSIRRC 2019),
July 2019.
PDF
- Exploiting Global Impact Ordering for Higher Throughput in Selective Search.
S. Siedlaczek, J. Rodriguez, and T. Suel.
European Conference on Information Retrieval,
April 2019, to appear.
PDF
- Compressing Inverted Indexes with Recursive Graph Bisection: A Reproducibility Study.
J. MacKenzie, A. Mallia, M. Petri, S. Culpepper, and T. Suel.
European Conference on Information Retrieval (Reproducibility Track),
April 2019, to appear.
PDF
- An Experimental Study of Index Compression and DAAT Query Processing Methods.
A. Mallia, S. Siedlaczek, and T. Suel.
European Conference on Information Retrieval (Reproducibility Track),
April 2019, to appear.
PDF
- Exploring Size-Speed Trade-Offs in Static Index Pruning.
J. Rodriguez and T. Suel.
IEEE International Conference on Big Data, December 2018.
PDF
Slides
- Fast Bag-Of-Words Candidate Selection in Content-Based Instance Retrieval Systems.
M. Siedlaczek, Q. Wang, Y. Chen, and T. Suel.
IEEE International Conference on Big Data, December 2018.
PDF
- Delta Compression Techniques.
T. Suel.
In Encyclopedia of Big Data Technologies, Springer, 2018.
PDF
- Improved Methods for Static Index Pruning.
W. Jiang, J. Rodriguez, and T. Suel.
IEEE International Conference on Big Data, December 2016.
PDF
- Efficient Index Updates for Mixed Update and Query Loads.
S. Nepomnyachiy and T. Suel.
IEEE International Conference on Big Data, December 2016.
PDF
- Three-Hop Distance Estimation in Social Graphs.
P. Welke, A. Markowetz, T. Suel and M. Christoforaki.
IEEE International Conference on Big Data, December 2016.
PDF
- What Makes A Group Fail: Modeling Social Group Behavior in Event-Based Social Networks.
X. Liu and T. Suel.
IEEE International Conference on Big Data, December 2016.
PDF
- Fast First-Phase Candidate Generation for Cascading Rankers.
Q. Wang, C. Dimopoulos, and T. Suel.
39th Annual ACM SIGIR Conference, July 2016.
PDF
- Structural Sentence Similarity Estimation for Short Texts.
W. Ma and T. Suel.
29th International FLAIRS Conference, May 2016.
PDF
- Estimating Pairwise Distances in Large Graphs.
M. Christoforaki and T. Suel.
IEEE International Conference on Big Data, October 2014.
PDF
- A Robust Model for Paper-Reviewer Assignment.
X. Liu, T. Suel, and N. Memon.
Proceedings of the ACM Conference on Recommender Systems (RecSys),
October 2014. PDF
- Automated Decision Support for Human Tasks in a Collaborative
System: The Case of Deletion in Wikipedia.
B. Gelley and T. Suel.
Proceedings of WikiSym, August 2013.
PDF
- A Candidate Filtering Mechanism for Fast Top-K Query Processing
on Modern CPUs.
C. Dimopoulos, S. Nepomnyachiy, and T. Suel.
36th Annual ACM SIGIR Conference, July 2013.
PDF
- Optimizing Top-k Document Retrieval Strategies for Block-Max
Indexes.
C. Dimopoulos, S. Nepomnyachiy, and T. Suel.
6th ACM Conference on Web Search and Data Mining, February 2013.
PDF
- Optimizing Positional Index Structures for Versioned Document
Collections.
J. He and T. Suel.
35th Annual ACM SIGIR Conference, July 2012.
PDF
- To Index or not to Index: Time-Space Trade-Offs in Search Engines
with Positional Ranking Functions.
D. Arroyuelo, S. Gonzalez, M. Marin, M. Oyarzun, and T. Suel.
35th Annual ACM SIGIR Conference, July 2012.
PDF
- Text vs. Space: Efficient Geo-Search Query Processing.
M. Christoforaki, J. He, C. Dimopoulos, A. Markowetz, and T. Suel.
20th ACM Conference on Information and Knowledge Management,
October 2011.
PDF
- Scalable Manipulation of Archival Web Graphs.
Y. Avcular and T. Suel.
Workshop on Large-Scale and Distributed Systems for Information
Retrieval. October 2011.
PDF
- Faster Temporal Range Queries over Versioned Text.
J. He and T. Suel.
34th Annual ACM SIGIR Conference, July 2011.
PDF
- Faster Top-k Document Retrieval Using Block-Max Indexes.
S. Ding and T. Suel.
34th Annual ACM SIGIR Conference, July 2011.
PDF
- Batch Query Processing for Web Search Engines.
S. Ding, J. Attenberg, R. Baeza-Yates, and T. Suel.
4th ACM Conference on Web Search and Data Mining, February 2011.
PDF
- Improved Index Compression Techniques for Versioned Document
Collections. With J. He and J. Zeng.
19th ACM Conference on Information and Knowledge Management,
October 2010.
PDF
- Efficient Term Proximity Search with Term-Pair Indexes.
With H. Yan, S. Shi, F. Zhang, and J. Wen.
19th ACM Conference on Information and Knowledge Management,
October 2010.
PDF
- Scalable Techniques for Document Identifier Assignment in
Inverted Indexes.
With S. Ding and J. Attenberg.
19th International World Wide Web Conference (WWW), April 2010.
PDF
- Compact Full-Text Indexing of Versioned Document
Collections.
With J. He and H. Yan.
18th Conference on Information and Knowledge Management,
November 2009. PDF
- Modeling and Predicting User Behavior in Sponsored Search.
With J. Attenberg and S. Pandey.
15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD),
June 2009. PDF
- Compressing Term Positions in Web Indexes.
With H. Yan and S. Ding.
32nd Annual ACM SIGIR Conference, June 2009.
PDF
- Using Graphics Processors for High-Performance IR Query
Processing. With S. Ding, J. He, and H. Yan.
18th International World Wide Web Conference (WWW), April 2009.
PDF
[An earlier shorter version appeared as a poster at the 17th WWW,
April 2008]
- Inverted Index Compression and Query Processing with Optimized
Document Ordering. With H. Yan and S. Ding.
18th International World Wide Web Conference (WWW), April 2009.
PDF
- Improved Techniques for Result Caching in Web Search
Engines. With Q. Gan.
18th International World Wide Web Conference (WWW), April 2009.
PDF
- Top-k Aggregation Using Intersection of Ranked Inputs.
with R. Kumar, K. Punera, and S. Vassilvitskii. Second ACM International
Conference on Web Search and Data Mining (WSDM), February 2009.
PDF
- Cleaning Search Results using Term Distance Features.
With J. Attenberg. 4th Workshop on Adversarial Information Retrieval
on the Web (in conjunction with WWW), April 2008.
PDF
- Geographic Web Usage Estimation by Monitoring DNS Caches.
With H. Akcan and H. Broennimann. 1st International Workshop on Location
and the Web (in conjunction with WWW), April 2008.
PDF
- Analysis of Geographic Queries in a Search Engine Log.
With Q. Gan, J. Attenberg, and A. Markowetz. 1st International Workshop on
Location and the Web (in conjunction with WWW), April 2008.
PDF
- Performance of Compressed Inverted List Caching in Search
Engines. With J. Zhang and X.Long. 17th International
World Wide Web Conference (WWW), April 2008, to appear.
PDF
- Algorithms for Low-Latency Remote File Synchronization.
With H. Yan and U. Irmak. IEEE Infocom Conference, April 2008.
PDF
- Improving Web Spam Classifiers Using Link Structure.
With Q. Gan. 3rd Workshop on Adversarial Information Retrieval
on the Web (held in conjunction with WWW), May 2007, to appear.
PDF
- Efficient Search in Large Textual Collections with Redundancy.
With J. Zhang. 16th International World Wide Web Conference (WWW),
May 2007, to appear.
PDF
- Optimized Inverted List Assignment in Distributed Search Engine
Architectures. With J. Zhang. 21st IEEE International Parallel
& Distributed Processing Symposium (IPDPS'07), March 2007, to appear.
PDF
- Efficient Query Subscription Processing for Prospective Search
Engines. With U. Irmak, S. Mihaylov, S. Ganguly, and R. Izmailov.
USENIX Annual Technical Conference, May 2006.
PDF
- Efficient Query Processing in Geographic Web Search
Engines. With Y. Chen and A. Markowetz. ACM Intern. Conference on
Management of Data (SIGMOD), June 2006.
PDF
- Interactive Wrapper Generation with Minimal User Effort.
With U. Irmak. 15th International World Wide Web
Conference (WWW), May 2006.
PDF
- Efficient Query Evaluation on Large Textual Collections in a
Peer-to-Peer Environment. With J. Zhang. 5th IEEE International
Conference on Peer-to-Peer Computing, August 2005.
PDF
- Design and Implementation of a Geographic Search Engine.
With A. Markowetz, Y. Chen, X. Long, and B. Seeger.
8th International Workshop on the Web and Databases
(WebDB), June 2005. PDF
(Note: an extended version is available as Technical Report
TR-CIS-2005-03, Polytechnic University, February 2005.
PDF)
- Hierarchical Substring Caching for Efficient Content Distribution
to Low-Bandwidth Clients. With U. Irmak. 14th International
World Wide Web Conference (WWW), May 2005.
PDF
- Three-Level Caching for Efficient Query Processing in Large Web
Search Engines. With X. Long. 14th International
World Wide Web Conference (WWW), May 2005.
PDF
- Improved Single-Round Protocols for Remote File Synchronization.
With U. Irmak and S. Mihaylov. IEEE Infocom Conference,
March 2005. PDF
(Note: an earlier version with some of the results appeared at the 4th
New York Metro Area Networking Workshop (NYMAN), September 2004.)
- Optimal Peer Selection for P2P Downloading and Streaming.
With M. Adler, R. Kumar, K. Ross, D. Rubenstein, and D. Yao.
IEEE Infocom Conference, March 2005. PDF
- The Perron-Frobenius Theorem and Some of its Applications.
With U. Pillai and S. Cha. IEEE Signal Processing Magazine 2, 2005, pp.
62-75.
- Approximation Algorithms for Array Partitioning Problems.
With S. Muthukrishnan. Journal of Algorithms 54, 2005, pp. 85-104.
PDF
- Local Methods for Estimating PageRank Values.
With Y. Chen and Q. Gan. 13th Conference on Information and Knowledge
Management (CIKM), November 2004.
PDF
(Note: an earlier version appeared at the 3rd Workshop on Web Dynamics
in conjunction with WWW 2004.)
- Compressing File Collections with a TSP-Based Approach.
With D. Trendafilov and N. Memon. Technical Report TR-CIS-2004-02,
Polytechnic University, April 2004.
PDF
- Improved File Synchronization Techniques for Maintaining Large
Replicated Collections over Slow Networks.
With P. Noel and D. Trendafilov. IEEE International Conference
on Data Engineering (ICDE), March 2004.
PDF
(Talk: PPT
PDF, 6 per page)
- Server-Friendly Delta Compression for Efficient Web Access.
With A. Savant. Eighth International Workshop on Web Content Caching
and Distribution (WCW), September 2003.
PDF
(Talk: PPT
PDF, 6 per page)
- Optimized Query Execution in Large Search Engines with Global
Page Ordering. With X. Long. International Conference on Very
Large Data Bases (VLDB), September 2003.
PDF
(Talk: PPT
PDF, 6 per page)
- ODISSEA: A Peer-to-Peer Architecture
for Scalable Web Search and Information Retrieval.
With C. Mathur, J. Wu, J. Zhang, A. Delis, M. Kharrazi, X Long, and
K. Shanmugasundaram. 6th International Workshop on the Web and Databases
(WebDB), June 2003. PDF
(Talk: PPT
PDF, 6 per page)
Technical Report (23 pages):
PDF
WWW 2003 Poster Version (2 pages):
PDF
- On the Scalability of an Image Transcoding Proxy Server.
With A. Savant and N. Memon. International Conference on Image
Processing, September 2003.
PDF
- Cluster-Based Delta Compression of Collections of Files.
With Z. Ouyang, N. Memon, and D. Trendafilov. International Conference
on Web Information Systems Engineering (WISE), December 2002.
PDF
(Talk: PPT
PDF, 6 per page)
- I/O-Efficient Techniques for Computing Pagerank.
With Y. Chen and Q. Gan. ACM Conference on Information and Knowledge
Engineering (CIKM), November 2002.
PDF
- zdelta: An Efficient Delta Compression Tool.
With D. Trendafilov and N. Memon. Technical Report TR-CIS-2002-02,
Polytechnic University, June 2002.
PDF
- Algorithms for Delta Compression and Remote File
Synchronization With N. Memon. Invited chapter in
Handbook of Lossless Compression. Edited by K. Sayood,
Academic Press, August 2002. PDF
(preliminary version)
- Design and Implementation of a High-Performance Distributed Web
Crawler.
With V. Shkapenyuk. IEEE International Conference on Data Engineering
(ICDE), February 2002.
Postscript
PDF
(Talk: PPT
PDF 6 per page)
- Compressing the Graph Structure of the Web.
With J. Yuan.
IEEE Data Compression Conference (DCC 2001),
March 2001. Postscript
PDF
- A Unified Approach for Indexed and Non-Indexed Spatial
Joins.
With L. Arge, O. Procopiuc, S. Ramaswamy, J. Vahrenhold, and J. Vitter.
7th International Conference on Extending Database Technology (EDBT 2000),
March 2000, pp. 412-429. Postscript
PDF
- On Rectangular Partitionings of Two-Dimensional Data: Algorithms,
Complexity. and Applications.
With S. Muthukrishnan and V. Poosala.
7th International Conference on Database Theory (ICDT '99), January
1999, pp. 236-256. Postscript
PDF
(Note: an updated
and extended version of the results on p-by-p partitionings appears
in the April 2003 paper listed above)
- Scalable Sweeping-Based Spatial Join.
With L. Arge, O. Procopiuc, S. Ramaswamy, and J. Vitter.
24th International Conference on Very Large Data Bases (VLDB '98),
August 1998, pp. 570-581.
Postscript
PDF
- Optimal Histograms with Quality Guarantees.
With H. Jagadish, N. Koudas, S. Muthukrishnan, V. Poosala, and K. Sevcik.
24th International Conference on Very Large Data Bases (VLDB '98),
August 1998, pp. 275-286.
Postscript
PDF
- Theory and Practice of I/O-Efficient Algorithms for
Multidimensional Batched Searching Problems.
With L. Arge, O. Procopiuc, S. Ramaswamy, and J. Vitter. 9th Symposium
on Discrete Algorithms (SODA '98), January 1998, pp. 685-694.
Postscript
PDF
|
|