Google Machine Learning Sentence Compression Algorithms Powers Features Snippets

google machine learning

The other day, I covered at Search Engine Land a Wired article named Google’s hand-fed AI now gives answers, not just search results.

The article explains that Google is now using "sentence compression algorithms" as of this week in the desktop search results. Sentence compression algorithms is Google’s way of extracting the best answer for a query to be displayed in the featured snippets.

Of course, this is not just used for featured snippets but also for Google Home responses, Google Assistant and more. Which is why it is important that Google build a better way to get more answers.

Here is a snippet (using my own sentence compression) to pull out the core nugget from this article:

Deep neutral nets are pattern recognition systems that can learn to perform specific tasks by analyzing vast amounts of data. In this case, theyâve learned to take a long sentence or paragraph from a relevant page on the web and extract the upshotâ"the information youâre looking for.

These âsentence compression algorithmsâ just went live on the desktop incarnation of the search engine. They handle a task thatâs pretty simple for humans but has traditionally been quite difficult for machines. They show how deep learning is advancing the art of natural language understanding, the ability to understand and respond to natural human speech. âYou need to use neural networks – or at least that is the only way we have found to do it,â Google research product manager David Orr says of the companyâs sentence compression work. âWe have to use all of the most advanced technology we have."

To train Googleâs artificial Q&A brain, Orr and company also use old news stories, where machines start to see how headlines serve as short summaries of the longer articles that follow. But for now, the company still needs its team of PhD linguists. They not only demonstrate sentence compression, but actually label parts of speech in ways that help neural nets understand how human language works. Spanning about 100 PhD linguists across the globe, the Pygmalion team produces what Orr calls âthe gold data,â while and the news stories are the âsilver.â The silver data is still useful, because thereâs so much of it. But the gold data is essential. Linne Ha, who oversees Pygmalion, says the team will continue to grow in the years to come.

This kind of human-assisted AI is called âsupervised learning,â and today, itâs just how neural networks operate. Sometimes, companies can crowdsource this workâ"or it just happens organically. People across the internet have already tagged millions of cats in cat photos, for instance, so that makes it easy to train a neural net that recognizes cats. But in other cases, researchers have no choice but to label the data on their own.

I wonder if you guys noticed any changes to the featured snippets that corroborate the Wired story that this went live on desktop search at Google this week?

I asked Glenn Gabe who tracks a nice number of featured snippets and he noticed no significant changes this week with them:

Forum discussion at Twitter.

Source

Twitter to Developers: Public APIs, Gnip Cannot Be Used for Surveillance

Twitter issued a reminder to developers that its public application-programming interfaces and data products are not to be used for surveillance purposes.

Vice president of data strategy Chris Moody issued the reminder in a blog post, writing that while public tweets have successfully been used by first responders in situations such as emergencies and natural disasters:

Recent reports about Twitter data being used for surveillance, however, have caused us great concern. As a company, our commitment to social justice is core to our mission and well established. And our policies in this area are long-standing. Using Twitter’s public APIs or data products to track or profile protesters and activists is absolutely unacceptable and prohibited.

To be clear: We prohibit developers using the public APIs and Gnip data products from allowing law enforcement–or any other entity–to use Twitter data for surveillance purposes. Period. The fact that our public APIs and Gnip data products provide information that people choose to share publicly does not change our policies in this area. And if developers violate our policies, we will take appropriate action, which can include suspension and termination of access to Twitter’s public APIs and data products.

Moody added that Twitter has an internal review process for Gnip data products when new developers come on board, and all or part of requested use cases may be rejected, adding:

Over the coming months, you’ll see us take on expanded enforcement and compliance efforts, including adding more resources for swiftly investigating and acting on complaints about the misuse of Twitter’s public APIs and Gnip data products.

Readers: What are your thoughts on Twitter’s message to developers?

Source

1 18 19 20 21 22 23