Research Blog: 2013

Groundbreaking simulations by Google Exacycle Visiting Faculty

Monday, December 16, 2013

Posted by David Konerding, Staff Software EngineerannouncedGoogle Exacycle for Visiting Facultyenables massive parallelism for doing science in the cloudproposalsKai Kohlhoffsignalling proteinconformational changesbeta-2 adrenergic receptorGPCRNature Chemistrywebsite

Googler Moti Yung elected as 2013 ACM Fellow

Wednesday, December 11, 2013

Posted by Alfred Spector, VP of EngineeringreleasedResearch Scientist Moti YungDr. Moti Yung: Research ScientistFor contributions to cryptography and its use in security and privacy of systemstraitor tracingthreshold cryptosystemszero knowledge proofs.

Free Language Lessons for Computers

Tuesday, December 03, 2013

Posted by Dave Orr, Google Research Product ManagerNot everything that can be counted counts.Not everything that counts can be counted.William Bruce Camerontell storiesvisualize informationmailing list50,000 Lessons on How to Read: a Relation Extraction CorpusWhat is itWikipediaWhere can I find ithttps://code.google.com/p/relation-extraction-corpus/I want to know morehandy blog post11 Billion Clues in 800 Million DocumentsWhat is itFreebase concept IDsWhere can I find itClueWeb09 FACCClueWeb12 FACCI want to know more

Features Extracted From YouTube Videos for Multiview LearningWhat is itWhere can I find itUCI machine learning repository (multiview video dataset)Google’s repositoryI want to know morehere40 Million Entities in ContextWhat is itWhere can I find itWikiLinks corpusUmass Wiki-linksI want to know moreblog post announcing the release

Distributing the Edit History of Wikipedia InfoboxesWhat is itWhere can I find itDownload from GoogleWikimedia DeutschlandI want to know morepostedpaper

Note the change in the capital of Palau.
Syntactic Ngrams over TimeWhat is itGoogle BooksGoogle Ngram ViewerWhere can I find ithttp://commondatastorage.googleapis.com/books/syntactic-ngrams/index.htmlI want to know moreblog postpaper about the release

Dictionaries for linking Text, Entities, and IdeasWhat is itWhere can I find ithttp://nlp.stanford.edu/pubs/crosswikis-data.tar.bz2I want to know moreblog postassociated paperOther datasets

Automatic Freebase annotations of Trec’s Million Query and Web track queries.

A set of Freebase triples that have been deleted from Freebase over time -- 63 million of them.

Released Data Set: Features Extracted From YouTube Videos for Multiview Learning

Tuesday, November 26, 2013

Posted by Omid Madani, Senior Software Engineer“If it looks like a duck, swims like a duck, and quacks like a duck, then it probably is a duck.” - The “duck test”multiple viewsUCI machine learning repository (multiview video dataset)here

The MiniZinc Challenge

Monday, November 25, 2013

Posted by Jon Orwant, Engineering ManagerConstraint Programmingschedulingor-toolsSATinteger programmingwikipedia pageor-toolshere

New Research Challenges in Language Understanding

Friday, November 22, 2013

Posted by Maggie Johnson, Director of Education and University Relationsagenda

Knowledge representation, integration, and maintenance

Efficient and scalable infrastructure and algorithms for inferencing

Presentation and explanation of knowledge

Multilingual computation

Faculty Research Awards program

Unique Strategies for Scaling Teacher Professional Development

Tuesday, November 19, 2013

Posted by Candice Reimers, Senior Program ManagerResearch showsCourse BuilderCreative ComputingNational GeographicAnnenberg LearnerWater: The Essential ResourceThe Friday InstituteDigital Learning Transitionspost-course surveycourse dataDigital Technology coursenew Australian curriculuma suite of coursescourseCommon Core State Standards

Moore’s Law Part 4: Moore's Law in other domains

Friday, November 15, 2013

This is the last entry of a series focused on Moore’s Law and its implications moving forward, edited from a White paper on Moore’s Law, written by Google University Relations Manager Michel Benard. This series quotes major sources about Moore’s Law and explores how they believe Moore’s Law will likely continue over the course of the next several years. We will also explore if there are fields other than digital electronics that either have an emerging Moore's Law situation, or promises for such a Law that would drive their future performance. for Moore’s LawSensors and Data AcquisitionEd Parsons, Google Geospatial TechnologistThe More than Moore discussion can be extended to outside of the main chip, and go within the same board as the main chip or within the device that a user is carrying. Greater sensors capabilities (for the measurement of pressure, electromagnetic field and other local conditions) allow including them in smart phones, glasses, or other devices and perform local data acquisition. This trend is strong, and should allow future devices benefiting from Moore’s Law to receive enough data to perform more complex applications.
Metcalfe’s Law states that the value of a telecommunication network is proportional to the square of connected nodes of the system. This law can be used in parallel to Moore’s Law to evaluate the value of the Internet of Things. The network itself can be seen as composed by layers: at the user’s local level (to capture data related to the body of the user, or to immediately accessible objects), locally around the user (such as to get data within the same street as the user), and finally globally (to get data from the global internet). The extrapolation made earlier in this blog (several TB available in flash memory) will lead to the ability to construct, exchange and download/upload entire contexts for a given situation or a given application and use these contexts without intense network activity, or even with very little or no network activity. Future of Moore’s Law and its impact on PhysicsSverre Jarp, CERNCERN, and its experiments with the Large Electron-Positron Collider (LEP) and Large Hadron Collider (LHC) generate data on the order of a PetaByte per year; this data has to be filtered, processed and analyzed in order to find meaningful physics events leading to new discoveries. In this context Moore’s Law has been particularly helpful to allow computing power, storage and networking capabilities at CERN and at other High Energy Physics (HEP) centers to scale up regularly. Several generations of hardware and software have been exhausted during the journey from mainframes to today’s clusters.
CERN has a long tradition of collaboration with chip manufacturers, hardware and software vendors to understand and predict next trends in the computing evolution curve. Recent analysis indicates that Moore’s Law will likely continue over the next decade. The statement of ‘several TB of flash memory availability by 2025’ may even be a little conservative according to most recent analysis.Big Data VisualizationsKaty Börner, Indiana UniversityThanks to Moore’s Law, the amount of data available for any given phenomenon, whether sensed or simulated, has been growing by several orders of magnitude over the past decades. Intelligent sampling can be used to filter out the most relevant bits of information and is practiced in Physics, Astronomy, Medicine and other sciences. Subsequently, data needs to be analyzed and visualized to identify meaningful trends and phenomena, and to communicate them to others.
While most people learn in school how to read charts and maps, many never learn how to read a network layout—data literacy remains a challenge. The Information Visualization Massive Open Online Course (MOOC) at Indiana University teaches students from more than 100 countries how to read but also how to design meaningful network, topical, geospatial, and temporal visualizations. Using the tools introduced in this free course anyone can analyze, visualize, and navigate complex data sets to understand patterns and trends.Candidate for Moore’s Law in Energy Professor Francesco Stellacci, EPFLIt is currently hard to see a “Moore’s Law” applying to candidates in energy technology. Nuclear fusion could reserve some positive surprises, if several significant breakthroughs are found in the process of creating usable energy with this technique. For any other technology the technological growth will be slower. Best solar cells of today have a 30% efficiency, which could scale higher of course (obviously not much more than a factor of 3). Also cost could be driven down by an order of magnitude. Best estimates show, however, a combined performance improvement by a factor 30 over many years.Further Discussion of Moore’s Law in EnergyRoss Koningstein, Google Director EmeritusAs of today there is no obvious Moore’s Law in the Energy sector which could decrease some major costs by 50% every 18 months. However material properties at nanoscale, and chemical processes such as catalysis are being investigated and could lead to promising results. Applications targeted are hydrocarbon creation at scale and improvement of oil refinery processes, where breakthrough in micro/nano property catalysts is pursued. Hydrocarbons are much more compatible at scale with the existing automotive/aviation and natural gas distribution systems. Here in California, Google Ventures has invested in Cool Planet Energy Systems, a company with neat technology that can convert biomass to gasoline/jet fuel/diesel with impressive efficiency.
One of the challenges is the ability to run many experiments at low cost per experiment, instead of only a few expensive experiments per year. Discoveries are likely to happen faster if more experiments are conducted. This leads to heavier investments, which are difficult to achieve within slim margin businesses. Therefore the nurturing processes for disruptive business are likely to come from new players, beside existing players which will decide to fund significant new investments.Research at Google Google+ page

The first detailed maps of global forest change

Thursday, November 14, 2013

Posted by Matt Hansen and Peter Potapov, University of Maryland; Rebecca Moore and Matt Hancher, Google

Global 30 meter resolution thematic maps of the Earth’s land surface: Landsat composite reference image (2000), summary map of forest loss, extent and gain (2000-2012), individual maps of forest extent, gain, loss, and loss color-coded by year. Click to enlarge

Landsat 7Google Earth Engine

The Chaco woodlands of Bolivia, Paraguay and Argentina are under intensive pressure from agroindustrial development. Paraguay’s Chaco woodlands within the western half of the country are experiencing rapid deforestation in the development of cattle ranches. The result is the highest rate of deforestation in the world. Click to enlarge

http://earthenginepartners.appspot.com/science-2013-global-forest Live-stream Presentation: Mapping Global Forest Change Live online presentation and demonstration, followed by Q&A Monday, November 18, 2013 at 1pm EST, 10am PST Link to live-streamed event: http://goo.gl/JbWWTk Please submit questions here: http://goo.gl/rhxK5XHigh-Resolution Global Maps of 21st-Century Forest Cover Change

Moore’s Law, Part 3: Possible extrapolations over the next 15 years and impact

Wednesday, November 13, 2013

This is the third entry of a series focused on Moore’s Law and its implications moving forward, edited from a White paper on Moore’s Law, written by Google University Relations Manager Michel Benard. This series quotes major sources about Moore’s Law and explores how they believe Moore’s Law will likely continue over the course of the next several years. We will also explore if there are fields other than digital electronics that either have an emerging Moore's Law situation, or promises for such a Law that would drive their future performance.More MooreOverall Roadmap Technology CharacteristicsORTC 2011 tablesDRAM

4Tb Flash multi-level cell (MLC) memory will be in production

There will be ~100 billion transistors per microprocessing unit (MPU)

1TB RAM Memory will cost less than $100

More than MooreResearch at Google Google+ page

Moore’s Law, Part 2: More Moore and More than Moore

Tuesday, November 12, 2013

This is the second entry of a series focused on Moore’s Law and its implications moving forward, edited from a White paper on Moore’s Law, written by Google University Relations Manager Michel Benard. This series quotes major sources about Moore’s Law and explores how they believe Moore’s Law will likely continue over the course of the next several years. We will also explore if there are fields other than digital electronics that either have an emerging Moore's Law situation, or promises for such a Law that would drive their future performance. One of the fundamental lessons derived for the past successes of the semiconductor industry comes for the observation that most of the innovations of the past ten years—those that indeed that have revolutionized the way CMOS transistors are manufactured nowadays—were initiated 10–15 years before they were incorporated into the CMOS process. Strained silicon research began in the early 90s, high-κ/metal-gate initiated in the mid-90s and multiple-gate transistors were pioneered in the late 90s. This fundamental observation generates a simple but fundamental question: “What should the ITRS do to identify now what the extended semiconductor industry will need 10–15 years from now?” International Technology Roadmap for Semiconductors 2012More MooreCMOSvery promising tunnel transistorsstack layers of transistorsBoolean logicquantum computingincreasing the number of states

More than MooreMEMSRF/AMSITRS Overall Roadmap Technology Characteristics (ORTC) 2012Research at Google Google+ page

Moore’s Law, Part 1: Brief history of Moore's Law and current state

Monday, November 11, 2013

This is the first entry of a series focused on Moore’s Law and its implications moving forward, edited from a White paper on Moore’s Law, written by Google University Relations Manager Michel Benard. This series quotes major sources about Moore’s Law and explores how they believe Moore’s Law will likely continue over the course of the next several years. We will also explore if there are fields other than digital electronics that either have an emerging Moore's Law situation, or promises for such a Law that would drive their future performance.
---Moore's Law is the observation that over the history of computing hardware, the number of transistors on integrated circuits doubles approximately every two years. The period often quoted as "18 months" is due to Intel executive David House, who predicted that period for a doubling in chip performance (being a combination of the effect of more transistors and their being faster). WikipediaGordon E. Moore1965 paperpixels in digital camerasOther formulations and similar lawsworld economy

Transistor counts for integrated circuits plotted against their dates of introduction. The curve shows Moore's law - the doubling of transistor counts every two years. The y-axis is logarithmic, so the line corresponds to exponential growth

NTRSITRSsources in 20052010 update to the ITRSMore than MooreSiPSoCCMOSResearch at Google Google+ page

Enhancing Linguistic Search with the Google Books Ngram Viewer

Thursday, October 17, 2013

Posted by Slav Petrov and Dipanjan Das, Research Scientistswhat noun most often follows “Queen” in English fictionhis Atlantic articlethe phrase “changing roles” has recently surged in popularity in English fictionwhen we add non-fiction into the mixcommon capitalizations of “Mother Earth”

Opening up Course Builder data

Wednesday, October 09, 2013

Posted by John Cox and Pavel Simakov, Course Builder Team, Google ResearchCourse Builderwrote a postbuild data processing pipelineslearn from the courses we’ve run

Projecting without a projector: sharing your smartphone content onto an arbitrary display

Thursday, September 26, 2013

Posted by Yang Li, Research Scientist, Google ResearchDeep ShotOpen Project

Broadening Google Patents

Tuesday, September 17, 2013

Posted by Jon Orwant, Engineering ManagerCross-posted with the US Public Policy Blog, the European Public Policy Blog, and Inside Search Blog.Google PatentsPrior Art FinderChinese dual-drive bicycleGerman valve for inflating bicycle tiresCanadian trailer to your bikeWIPO application for pedalling with one legGoogle Translate

We are joining the Open edX platform

Tuesday, September 10, 2013

Posted by Dan Clancy, Director of ResearchCourse BuilderIntroduction to Web AccessibilityedXthe findings of which

Make Your Websites More Accessible to More Users with Introduction to Web Accessibility

Tuesday, September 10, 2013

Eve Andersson, Manager, Accessibility EngineeringCross-posted with Google Developer's BlogEnglandGermanyJapanIntroduction to Web AccessibilityRegistration

A Comparison of Five Google Online Courses

Thursday, September 05, 2013

Posted by Julia Wilkowski, Senior Instructional DesignerObservation #1: Course size

*based on surveys sent only to course completers. Other satisfaction scores represent aggregate survey results sent to all registrants.

Observation #2: Completion rates

Figure 1. Unique page views for Power Searching and Advanced Power Searching

Observation #3: Students have varied goals

52% of registrants intended to complete the course

48% merely wanted to learn a few new things about Google’s mapping tools

78% of students achieved the goal they defined at registration

89% of students learned new features of Google Maps

76% reported learning new features of Google Earth

Observation #4: Continued interest in post-course access

Google Research Awards: Summer 2013

Monday, August 12, 2013

Posted by Maggie Johnson, Director of Education & University RelationsGoogle Research AwardsAndroid-basedGoogle Glassrecipients of this round’s awardsour website

Computer Science Teaching Fellows Starting Up in Charleston, SC

Wednesday, August 07, 2013

Posted by Cameron Fadjo, Program Lead, Computer Science Teaching FellowsSouth Carolina data centerall

Source: 2009-2010 CRA Taulbee Survey (http://www.cra.org/resources/)

NSFCS PrinciplesCSTAstandardsreportComputing in the CoreCode.orgElementary and Secondary School Actsupport CS educationMOOCsmachine learningKhan Academy

Under the hood of Croatian, Filipino, Ukrainian, and Vietnamese in Google Voice Search

Thursday, July 25, 2013

Posted by Eugene Weinstein and Pedro Moreno, Google Speech Teamtonal languagetonemescode switchingneural networkdiscovered cats

11 Billion Clues in 800 Million Documents: A Web Research Corpus Annotated with Freebase Concepts

Wednesday, July 17, 2013

Posted by Dave Orr, Amar Subramanya, Evgeniy Gabrilovich, and Michael Ringgaard, Google Research “I assume that by knowing the truth you mean knowing things as they really are.” - PlatoPlatoKnowledge GraphFreebasedata to help with disambiguation$1.2M in research grantsClueWeb09 FACCClueWeb12 FACCFreebase MID’s

TREC query setsMillion Query TrackWeb TrackWikilinks CorpusClueWeb09 FACCClueWeb12 FACCdata release mailing listSpecial thanks to Jamie Callan and Juan Caicedo Carvajal for their help throughout the annotation project.

Google Research Blog