Twitter: a good place to detect health conditions VM Prieto, S Matos, M Alvarez, F Cacheda, JL Oliveira PloS one 9 (1), e86191, 2014 | 194 | 2014 |
Extracting lists of data records from semi-structured web pages M Álvarez, A Pan, J Raposo, F Bellas, F Cacheda Data & Knowledge Engineering 64 (2), 491-509, 2008 | 113 | 2008 |
Semi-automatic wrapper generation for commercial web sources A Pan, J Raposo, M Álvarez, J Hidalgo, Á Viña Engineering Information Systems in the Internet Context: IFIP TC8/WG8. 1 …, 2002 | 82 | 2002 |
Crawling the content hidden behind web forms M Álvarez, J Raposo, A Pan, F Cacheda, F Bellas, V Carneiro International Conference on Computational Science and Its Applications, 322-333, 2007 | 63 | 2007 |
Crawling the content hidden behind web forms M Álvarez, J Raposo, A Pan, F Cacheda, F Bellas, V Carneiro International Conference on Computational Science and Its Applications, 322-333, 2007 | 63 | 2007 |
DeepBot: a focused crawler for accessing hidden web content M Álvarez, J Raposo, A Pan, F Cacheda, F Bellas, V Carneiro Proceedings of the 3rd international workshop on Data enginering issues in E …, 2007 | 57 | 2007 |
The wargo system: Semi-automatic wrapper generation in presence of complex data access modes J Raposo, A Pan, M Álvarez, J Hidalgo, Á Viña Database and Expert Systems Applications, 2002. Proceedings. 13th …, 2002 | 50 | 2002 |
Automatically maintaining wrappers for semi-structured web sources J Raposo, A Pan, M Álvarez, J Hidalgo Data & Knowledge Engineering 61 (2), 331-358, 2007 | 46 | 2007 |
The denodo data integration platform A Pan, J Raposo, M Álvarez, P Montoto, V Orjales, J Hidalgo, L Ardao, ... VLDB'02: Proceedings of the 28th International Conference on Very Large …, 2002 | 44 | 2002 |
SAAD, a content based Web Spam Analyzer and Detector VM Prieto, M Álvarez, F Cacheda Journal of Systems and Software 86 (11), 2906-2918, 2013 | 37 | 2013 |
Finding and extracting data records from web pages M Álvarez, A Pan, J Raposo, F Bellas, F Cacheda Journal of Signal Processing Systems 59, 123-137, 2010 | 36 | 2010 |
Detecting linkedin spammers and its spam nets VM Prieto, M Alvarez, F Cacheda International Journal of Advanced Computer Science and Applications (IJACSA …, 2013 | 24 | 2013 |
Client-side deep web data extraction M Alvarez, A Pan, J Raposo, A Vina IEEE International Conference on E-Commerce Technology for Dynamic E …, 2004 | 24 | 2004 |
A Model for Advanced Query Capability Description in Mediator Systems. A Pan, P Montoto, A Molano, M Álvarez, J Raposo, Á Viña ICEIS, 140-147, 2002 | 24 | 2002 |
Analysis and detection of web spam by means of web content VM Prieto, M Álvarez, R López-García, F Cacheda Multidisciplinary Information Retrieval: 5th Information Retrieval Facility …, 2012 | 23 | 2012 |
A Task-specific Approach for Crawling the Deep Web. M Álvarez, J Raposo, F Cacheda, A Pan Engineering Letters 13 (3), 2006 | 20 | 2006 |
Using clustering and edit distance techniques for automatic web data extraction M Álvarez, A Pan, J Raposo, F Bellas, F Cacheda Web Information Systems Engineering–WISE 2007, 212-224, 2007 | 19 | 2007 |
Crawling web pages with support for client-side dynamism M Álvarez, A Pan, J Raposo, J Hidalgo International Conference on Web-Age Information Management, 252-262, 2006 | 19 | 2006 |
Automatic wrapper maintenance for semi-structured web sources using results from previous queries J Raposo, A Pan, M Álvarez, Á Viña Proceedings of the 2005 ACM symposium on Applied computing, 654-659, 2005 | 18 | 2005 |
A Workflow Language for Web Automation. P Montoto, A Pan, J Raposo, J Losada, F Bellas, V Carneiro J. Univers. Comput. Sci. 14 (11), 1838-1856, 2008 | 16 | 2008 |