Follow
Marc Najork
Marc Najork
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Detecting spam web pages through content analysis
A Ntoulas, M Najork, M Manasse, D Fetterly
Proceedings of the 15th international conference on World Wide Web, 83-92, 2006
9162006
Mercator: A scalable, extensible web crawler
A Heydon, M Najork
World Wide Web 2 (4), 219-229, 1999
8851999
A large-scale study of the evolution of web pages
D Fetterly, M Manasse, M Najork, J Wiener
Proceedings of the 12th international conference on World Wide Web, 669-678, 2003
8382003
Breadth-first crawling yields high-quality pages
M Najork, JL Wiener
Proceedings of the 10th international conference on World Wide Web, 114-118, 2001
6382001
Web crawling
C Olston, M Najork
Foundations and Trends® in Information Retrieval 4 (3), 175-246, 2010
6372010
Spam, damn spam, and statistics: Using statistical analysis to locate spam web pages
D Fetterly, M Manasse, M Najork
Proceedings of the 7th International Workshop on the Web and Databases …, 2004
4822004
On near-uniform URL sampling
MR Henzinger, A Heydon, M Mitzenmacher, M Najork
Computer Networks 33 (1-6), 295-308, 2000
3482000
Position Bias Estimation for Unbiased Learning to Rank in Personal Search
X Wang, N Golbandi, M Bendersky, D Metzler, M Najork
11th ACM International Conference on Web Search and Data Mining, 2018
3052018
Learning to rank with selection bias in personal search
X Wang, M Bendersky, D Metzler, M Najork
39th International ACM SIGIR Conference on Research and Development in …, 2016
3002016
WIT: Wikipedia-based image text dataset for multimodal multilingual machine learning
K Srinivasan, K Raman, J Chen, M Bendersky, M Najork
44th International ACM SIGIR Conference on Research and Development in …, 2021
2872021
Boxwood: Abstractions as the Foundation for Storage Infrastructure.
J MacCormick, N Murphy, M Najork, CA Thekkath, L Zhou
OSDI 4, 8-8, 2004
2792004
Automatically Creating Training Data For Language Identifiers
M Goldszmit, M Najork, S Paparizos
US Patent App. 13/943,788, 2015
2482015
On the evolution of clusters of near-duplicate web pages
D Fetterly, M Manasse, M Najork
Proceeding of the 1st Latin American Web Congress, 37-45, 2003
2242003
High-performance web crawling
M Najork, A Heydon
Handbook of massive data sets, 25-45, 2002
223*2002
SOCIAL NETWORK RECOMMENDED CONTENT AND RECOMMENDING MEMBERS FOR PERSONALIZED SEARCH RESULTS
T Harrington, R Shenoy, M Najork, R Panigrahy
US Patent App. 13/252,215, 2013
2122013
Measuring index quality using random walks on the Web
MR Henzinger, A Heydon, M Mitzenmacher, M Najork
Computer Networks 31 (11-16), 1291-1303, 1999
2101999
Detecting phrase-level duplication on the world wide web
D Fetterly, M Manasse, M Najork
Proceedings of the 28th annual international ACM SIGIR conference on …, 2005
1932005
System and method for associating an extensible set of data with documents downloaded by a web crawler
MA Najork, CA Heydon
US Patent 6,351,755, 2002
1842002
A sketch-based distance oracle for web-scale graphs
A Das Sarma, S Gollapudi, M Najork, R Panigrahy
Proceedings of the third ACM international conference on Web search and data …, 2010
1762010
The LambdaLoss Framework for Ranking Metric Optimization
X Wang, C Li, N Golbandi, M Bendersky, M Najork
27th ACM International Conference on Information and Knowledge Management …, 2018
1722018
The system can't perform the operation now. Try again later.
Articles 1–20