Panagiotis G. Ipeirotis

Last Modified: Sunday, 07-Jun-2009 13:51:13 EDT

Latest version always available at http://pages.stern.nyu.edu/~panos/cv.html


Information Systems Group
Department of Information, Operations, and Management Sciences
Leonard N. Stern School of Business
New York University
44 West 4th Street, Suite 8-84
New York, NY 10012-1126, USA
 

Phone: +1 (212) 998-0803
Fax: +1 (212) 995-4228
panos@stern.nyu.edu
http://www.stern.nyu.edu/~panos


Education

Research Interests

Professional Employment

Honors and Awards

Teaching Experience

Academic Activities

Invited Talks

Work in Progress

  1. The Dimensions of Reputation in Electronic Markets,
    A. Ghose, P. Ipeirotis, and A. Sundararajan,
    (under review, Management Information Systems Quarterly)
  2. Deriving the Pricing Power of Product Features by Mining Consumer Reviews,
    N. Archak, A. Ghose, and P. Ipeirotis
    (under 2nd round of review, Management Science)
    Journal version of KDD 2007 paper
  3. Estimating the Socio-Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics,
    A. Ghose and P. Ipeirotis,
    (under 2nd round of review, IEEE Transactions on Knowledge and Data Engineering (TKDE))
    Journal version of ICEC 2007, WITS 2006 papers
  4. Answering General Time Sensitive Queries,
    W. Dakka, L. Gravano, and P. Ipeirotis,
    (under review, IEEE Transactions on Knowledge and Data Engineering (TKDE))
    Journal version of CIKM 2008 paper
  5. Improving Data Quality and Data Mining Using Multiple, Noisy Labelers
    V. Sheng, F. Provost, and P. Ipeirotis
    (to be submitted)
    Journal version of KDD 2008 paper
  6. Modeling Dependencies in Prediction Markets,
    N. Archak and P. Ipeirotis
    (to be submitted)

Papers in Refereed Journals

  1. A Quality-Aware Optimizer for Information Extraction,
    A. Jain and P. Ipeirotis,
    ACM Transactions on Database Systems (TODS), April 2009
  2. Classification-Aware Hidden-Web Text Database Selection,
    P. Ipeirotis and L. Gravano,
    ACM Transactions on Information Systems (TOIS), April 2008
  3. Towards a Query Optimizer for Text-Centric Tasks,
    P. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano,
    ACM Transactions on Database Systems (TODS), vol. 32, no. 4, December 2007
  4. Modeling and Managing Content Changes in Text Databases,
    P. Ipeirotis, A. Ntoulas, J. Cho, and L. Gravano,
    ACM Transactions on Database Systems (TODS), vol. 32, no. 3, September 2007
  5. Duplicate Record Detection: A Survey,
    A. Elmagarmid, P. Ipeirotis, and V. Verykios,
    IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 19, no. 1, January 2007
  6. QProber: A System for Automatic Classification of Hidden-Web Databases,
    L. Gravano, P. Ipeirotis, and M. Sahami,
    ACM Transactions on Information Systems (TOIS), vol. 21, no. 1, January 2003

Papers in Refereed Conferences

  1. Modeling Volatility in Prediction Markets,
    N. Archak and P. Ipeirotis
    Proceedings for the 10th ACM Conference on Electronic Commerce (EC'09), 2009 (40/158 = 25% accepted)
  2. Query by Document,
    Y. Yang, N. Bansal, W. Dakka, P. Ipeirotis, N. Koudas, and D. Papadias
    Second ACM International Conference on Web Search and Data Mining (WSDM 2009), 2009 (29/170 = 17% accepted)
  3. Join Optimization of Information Extraction Output: Quality Matters!,
    A. Jain, P. Ipeirotis, A. Doan, and L. Gravano
    Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE 2009), 2009
  4. Get Another Label? Improving Data Quality and Data Mining Using Multiple, Noisy Labelers, (Best Paper Award Runner Up),
    V. Sheng, F. Provost, and P. Ipeirotis
    Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining  (KDD 2008), 2008 (50/~500 < 10% accepted)
  5. Automatic Extraction of Useful Facet Hierarchies from Text Databases,
    W. Dakka and P. Ipeirotis
    Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE 2008), 2008
  6. Show me the Money! Deriving the Pricing Power of Product Features by Mining Consumer Reviews,
    N. Archak, A. Ghose, and P. Ipeirotis
    Proceedings of the Thirteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2007), 2007 (~100/513 < 20% accepted)
  7. Opinion Mining Using Econometrics: A Case Study on Reputation Systems,
    A. Ghose, P. Ipeirotis, and A. Sundararajan
    Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), 2007 (132/588 = 22% accepted)
  8. To Search or to Crawl? Towards a Query Optimizer for Text-Centric Tasks, (Best Paper Award)
    P. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano,
    in Proceedings of the 2006 ACM International Conference on Management of Data (SIGMOD 2006), 2006 (58/446 = 13% accepted)
    (the work also appears in the ACM TODS paper "Towards a Query Optimizer for Text-Centric Tasks")
  9. Automatic Construction of Multifaceted Browsing Interfaces,
    W. Dakka, P. Ipeirotis, and K. Wood,
    Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, 2005 (76/425 = 18% accepted)
  10. Modeling and Managing Content Changes in Text Databases, (Best Paper Award)
    P. Ipeirotis, A. Ntoulas, J. Cho, and L. Gravano,
    Proceedings of the 21st IEEE International Conference on Data Engineering (ICDE 2005), 2005 (67/521 = 13% accepted)
    (the work also appears in the ACM TODS paper "Modeling and Managing Content Changes in Text Databases")
  11. When one Sample is not Enough: Improving Text Database Selection Using Shrinkage,
    P. Ipeirotis and L. Gravano,
    Proceedings of the 2004 ACM International Conference on Management of Data (SIGMOD 2004), 2004 (69/431 = 16% accepted)
    (the work also appears in the ACM TOIS paper "Classification-Aware Hidden-Web Text Database Selection")
  12. Text joins in an RDBMS for Web Data Integration,
    L. Gravano, P. Ipeirotis, N. Koudas, and D. Srivastava,
    Proceedings of the 12th International World-Wide Web Conference (WWW2003), 2003 (13% accepted, more than 600 submissions)
  13. Text Joins for Data Cleansing and Integration in an RDBMS,,
    L. Gravano, P. Ipeirotis, N. Koudas, and D. Srivastava,
    Proceedings of the 19th IEEE International Conference on Data Engineering (ICDE 2003), 2003
  14. Distributed Search over the Hidden-Web: Hierarchical Database Sampling and Selection,
    P. Ipeirotis and L. Gravano,
    in Proceedings of the 28th International Conference on Very Large Databases (VLDB 2002), 2002 (69/432 = 16% accepted)
    (the work also appears in the ACM TOIS paper "Classification-Aware Hidden-Web Text Database Selection")
  15. Extending SDARTS: Extracting Metadata from Web Databases and Interfacing with the Open Archives Initiative,
    P. Ipeirotis, T. Barry, and L. Gravano,
    Proceedings of the Second ACM+IEEE Joint Conference on Digital Libraries (JCDL 2002), 2002 (33% accepted)
  16. Approximate String Joins in a Database (Almost) for Free,,
    L. Gravano, P. Ipeirotis, H.V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava,
    Proceedings of the 27th International Conference on Very Large Databases (VLDB 2001), 2001 (59/339 = 17% accepted)
  17. Probe, Count, and Classify: Categorizing Hidden-Web Databases,
    P. Ipeirotis, L. Gravano, and M. Sahami,
    Proceedings of the 2001 ACM International Conference on Management of Data (SIGMOD 2001), 2001 (15% accepted)
    (the work also appears in the ACM TOIS paper "QProber: A System for Automatic Classification of Hidden-Web Databases")
  18. SDLIP + STARTS = SDARTS. A Protocol and Toolkit for Metasearching,
    N. Green, P. Ipeirotis, and L. Gravano,

    Proceedings of the First ACM+IEEE Joint Conference on Digital Libraries (JCDL 2001), 2001

Book Chapters

  1. Taxonomy Design for Faceted Search
    W. Dakka and P. Ipeirotis
    Dynamic Taxonomies and Faceted Search: Theory, Practice, and Experience, Springer, 2009
  2. Searching Digital Libraries,
    P. Ipeirotis,
    Encyclopedia of Database Systems, Springer, 2008

Papers in Refereed Workshops and Demonstration Sessions

  1. Answering General Time Sensitive Queries,
    W. Dakka, L. Gravano, and P. Ipeirotis,
    Proceedings of the 2008 ACM CIKM International Conference on Information and Knowledge Management (CIKM 2008), 2008
  2. Stay Elsewhere? Improving Local Search for Hotels Using Econometric Modeling and Image Classification,
    B. Li, A. Ghose, and P. Ipeirotis
    Proceedings of the Sixth International Workshop on the Web and Databases (WebDB 2008), 2008 (14/30 = 46% accepted)
  3. The Impact of Information Disclosure on Stock Market Returns: The Sarbanes-Oxley Act and the Role of Media as an Information Intermediary,
    K. Balakrishnan, A. Ghose, and P. Ipeirotis,

    Proceedings of the Seventh Workshop on the Economics of Information Security (WEIS 2008), 2008
  4. Multifaceted Browsing over Large Databases of Text-Annotated Objects,
    WW. Dakka, P. Ipeirotis, and K. Wood,
    Proceedings of the 23rd IEEE International Conference on Data Engineering, Demonstrations (ICDE 2007), 2007 (28/73 = 38% accepted)
  5. Designing Ranking Systems for Consumer Reviews: The Impact of Review Subjectivity on Product Sales and Review Quality
    A. Ghose and P. Ipeirotis,
    Proceedings of the 2006 Workshop on Information Technology and Systems (WITS 2006), 2006 (36/105 = 35% accepted)
  6. Automatic Discovery of Useful Facet Terms,
    W. Dakka, R. Dayal, and P. Ipeirotis
    Proceedings of the ACM SIGIR 2006 Workshop on Faceted Search, 2006
  7. Reputation Premiums in Electronic Peer-to-Peer Markets: Analyzing Textual Feedback and Network Structure,,
    A. Ghose, P. Ipeirotis, and A. Sundararajan
    ACM SIGCOMM 2005 Workshop Proceedings, Third Workshop on Economics of Peer-to-Peer Systems,  (P2PEcon 2005), 2005
  8. Modeling Query-Based Access to Text Databases,
    E. Agichtein, P. Ipeirotis, and L. Gravano,
    Proceedings of the Sixth ACM SIGMOD International Workshop on the Web and Databases (WebDB 2003), 2003 (25% accepted)
  9. PERSIVAL Demo: Categorizing Hidden-Web Resources,
    P. Ipeirotis, L. Gravano, and M. Sahami,
    Proceedings of the First ACM+IEEE Joint Conference on Digital Libraries (JCDL 2001), 2001
  10. Automatic Classification of Text Databases through Query Probing,,
    P. Ipeirotis, L. Gravano, and M. Sahami,
    Proceedings of the Third International Workshop on the Web and Databases, (WebDB 2000), 2000 (29% accepted)
    (the work also appears in the ACM TOIS paper "QProber: A System for Automatic Classification of Hidden-Web Databases" and in the ACM SIGMOD 2001 conference paper "Probe, Count, and Classify: Categorizing Hidden-Web Databases")

Invited Papers

  1. The EconoMining Project at NYU: Studying the Economic Value of User-Generated Content on the Internet,
    A. Ghose and P. Ipeirotis
    Journal of Revenue and Pricing Management, 2009
  2. Building Query Optimizers for Information Extraction: The SQoUT Project,
    A. Jain, P. Ipeirotis, and L. Gravano,
    SIGMOD Record, Special Issue on "Managing Information Extraction," vol. 37, no. 4, December 2008
  3. Designing Novel Review Ranking Systems: Predicting Usefulness and Impact of Reviews,
    A. Ghose and P. Ipeirotis,

    Proceedings of the Ninth International Conference on Electronic Commerce (ICEC 2007), 2007
  4. Query- vs. Crawling-based Classification of Searchable Web Databases,
    L. Gravano, P. Ipeirotis, and M. Sahami,

    IEEE Data Engineering Bulletin, vol. 25, no. 1, March 2002
  5. Using q-grams in a DBMS for Approximate String Processing,,
    L. Gravano, P. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, L. Pietarinen, and D. Srivastava,

    IEEE Data Engineering Bulletin, vol. 24, no. 4, December 2001

Miscellaneous Publications

  1. Multi-labeling When Data Preprocessing Is Costly,
    V. Sheng, F. Provost, P. Ipeirotis
    INFORMS Annual Meeting, 2008
  2. Improving Data Quality and Data Mining Using Multiple, Noisy Labelers,
    V. Sheng, F. Provost, P. Ipeirotis
    3rd Annual Machine Learning Symposium, 2008
  3. Detecting Important Events Using Prediction Markets, Text Mining, and Volatility Modeling,
    G. Tziralis and P. Ipeirotis
    Third Workshop on Prediction Markets, 2008
  4. Noisy Multi-Labeling for Data Mining
    V. Sheng, F. Provost, and P. Ipeirotis
    Fourth Research Symposium on Statistical Challenges in E-Commerce, 2008
  5. Measuring the Pricing Power of User-Generated Reviews for Hedonic Goods
    N. Archak, A. Ghose, and P. Ipeirotis,
    Fourth Research Symposium on Statistical Challenges in E-Commerce, 2008
  6. Stay Elsewhere? The Economic Impact of Location-based Hotel Features: A View from Remote Sensing Image Analysis
    B. Li, A. Ghose, and P. Ipeirotis
    Winter Conference on Business Intelligence, 2008
  7. Designing Novel Review Ranking Systems on the Web: Combining Economics with Opinion Mining
    A. Ghose and P. Ipeirotis
    Third Research Symposium on Statistical Challenges in E-Commerce, 2007
  8. Towards Automating the Pricing Power of Product Attributes: An Analysis of Online Product Reviews
    N. Archak, A. Ghose, and P. Ipeirotis,
    Winter Conference on Business Intelligence, 2007
  9. Designing Ranking Systems for Consumer Reviews: The Economic Impact of Customer Sentiment in Electronic Markets
    A. Ghose and P. Ipeirotis,
    Proceedings of the International Conference on Decision Support Systems (ICDSS 2007), 2007
  10. The Dimensions of Reputation in Electronic Markets,
    A. Ghose, P. Ipeirotis, and A. Sundararajan
    Second Research Symposium on Statistical Challenges in E-Commerce, 2006

Society membership

University Service

Other Educational Activities