The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - March/April (2009 vol.24)
pp: 8-12
Alon Halevy , Google
Peter Norvig , Google
ABSTRACT
Problems that involve interacting with humans, such as natural language understanding, have not proven to be solvable by concise, neat formulas like F = ma. Instead, the best approach appears to be to embrace the complexity of the domain and address it by harnessing the power of data: if other humans engage in the tasks and generate large amounts of unlabeled, noisy data, new algorithms can be used to build high-quality models from the data.
INDEX TERMS
machine learning, very large data bases, Semantic Web
CITATION
Alon Halevy, Peter Norvig, Fernando Pereira, "The Unreasonable Effectiveness of Data", IEEE Intelligent Systems, vol.24, no. 2, pp. 8-12, March/April 2009, doi:10.1109/MIS.2009.36
REFERENCES
1. E. Wigner, "The Unreasonable Effectiveness of Mathematics in the Natural Sciences," Comm. Pure and Applied Mathematics, vol. 13, no. 1, 1960 pp. 1–14.
2. R. Quirk et al., A Comprehensive Grammar of the English Language, Longman, 1985.
3. H. Kucera, W.N. Francis, and J.B. Carroll, Computational Analysis of Present-Day American English, Brown Univ. Press, 1967.
4. T. Brants and A. Franz, Web 1T 5-Gram Version 1, Linguistic Data Consortium, 2006.
5. S. Riezler, Y. Liu, and A. Vasserman, "Translating Queries into Snippets for Improved Query Expansion," Proc. 22nd Int'l Conf. Computational Linguistics (Coling 08), Assoc. Computational Linguistics, 2008 pp. 737–744.
6. P.P. Talukdar et al., "Learning to Create Data-Integrating Queries," Proc. 34th Int'l Conf. Very Large Databases (VLDB 08), Very Large Database Endowment, 2008 pp. 785–796.
7. J. Hays and A.A. Efros, "Scene Completion Using Millions of Photographs," Comm. ACM, vol. 51, no. 10, 2008 pp. 87–94.
8. L. Getoor and B. Taskar, Introduction to Statistical Relational Learning, MIT Press, 2007.
9. B. Taskar et al., "Max-Margin Parsing," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP 04), Assoc. for Computational Linguistics, 2004 pp. 1–8.
10. S. Schoenmackers, O. Etzioni, and D.S. Weld, "Scaling Textual Inference to the Web," Proc. 2008 Conf. Empirical Methods in Natural Language Processing (EMNLP 08), Assoc. for Computational Linguistics, 2008 pp. 79–88.
11. T. Berners-Lee, J. Hendler, and O. Lassila, "The Semantic Web," Scientific Am.,17 May 2001.
12. P. Friedland et al., "Towards a Quantitative, Platform-Independent Analysis of Knowledge Systems," Proc. Int'l Conf. Principles of Knowledge Representation, AAAI Press, 2004 pp. 507–514.
13. "Interview of Tom Gruber," AIS SIGSEMIS Bull., vol. 1, no. 3, 2004.
14. M.J. Cafarella et al., "WebTables: Exploring the Power of Tables on the Web," Proc. Very Large Data Base Endowment (VLDB 08), ACM Press, 2008 pp. 538–549.
15. M. Paşca, "Organizing and Searching the World Wide Web of Facts. Step Two: Harnessing the Wisdom of the Crowds," Proc. 16th Int'l World Wide Web Conf., ACM Press, 2007 pp. 101–110.
3 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool