Tag: algorithm

Similar Image Search

isk-daemon is an open source database server capable of adding content-based (visual) image searching to any image related website or software. This technology allows users of any image-related website or software to sketch on a widget which image they want to find and have the website reply to them the most similar images or simply request for more similar photos at each image detail page.

similarity search for images.

Book{Un}Suggester

a new feature designed to expose LibraryThing’s excellent and varied recommendations to members and non-members alike. We put them alongside Amazon’s, which are also quite good. We are proud of our recommendations, but haven’t perfected the perfect algorithm yet. When we’ve made things as good as we can, we’re going to start offering recommended book data to libraries. But to heck with that! Let’s talk about bad recommendations. Today we introduce UnSuggester, “the worst recommendation system ever devised™.”

LT continues to introduce innovative features. i love them

Polyline Reduction

might come in handy for overview routes

Encoded Polyline Algorithm

aha, gmaps employs some clever compression for polylines, using offsets

Multi Margin Matrix Factorization

a technique for recommendation engines

Autoregressive integrated moving average

generally considered the best moving average formula

TreeJuxtaposer

Structural comparison of large trees is a difficult task that is only partially supported by current visualization techniques, which are mainly designed for browsing. We present TreeJuxtaposer, a system designed to support the comparison task for large trees of several 100K nodes. We introduce the idea of “guaranteed visibility”, where highlighted areas are treated as landmarks that must remain visually apparent at all times. We propose a new methodology for detailed structural comparison between 2 trees and provide a new nearly-linear algorithm for computing the best corresponding node from one tree to another. In addition, we present a new rectilinear Focus+Context technique for navigation that is well suited to the dynamic linking of side-by-side views while guaranteeing landmark visibility and constant frame rates. These 3 contributions result in a system delivering a fluid exploration experience that scales both in the size of the dataset and the number of pixels in the display. We have based the design decisions for our system on the needs of a target audience of biologists who must understand the structural details of many phylogenetic, or evolutionary, trees. Our tool is also useful in many other application domains where tree comparison is needed, ranging from network management to call graph optimization to genealogy.

Rendering Effective Route Maps

this is a classic paper for routing: exaggerate the details to make them stand out, and compress long stretches.

Route maps, which depict a path from one location to another, have emerged as one of the most popular applications on the Web. Current computer-generated route maps, however, are often very difficult to use. In this paper we present a set of cartographic generalization techniques specifically designed to improve the usability of route maps. Our generalization techniques are based both on cognitive psychology research studying how route maps are used and on an analysis of the generalizations commonly found in handdrawn route maps. We describe algorithmic implementations of these generalization techniques within LineDrive, a real-time system for automatically designing and rendering route maps. Feedback from over 2200 users indicates that almost all believe LineDrive maps are preferable to using standard computer-generated route maps alone.

SIGGRAPH

Proceedings of ACM SIGGRAPH 2006

Future images are going to be far more dynamic objects than today in about the same way as text has gone from something fixed and stable to something that is endlessly recombined and automatically collated, summarized, analyzed and hyperlinked.

2007-08-14: SIGGRAPH 2007 Papers. candy store!
2013-07-25: SIGGRAPH is always fun, and this is no exception.

2013-10-28: Looks shopped. I can tell from the pixels.

2013-11-10: Always amazing.

2014-05-17: The biggest disappointment with SIGGRAPH is how little of it makes it into daily life.

2015-06-29: cool new toys

2016-03-21: way too many explody things is getting old

2017-05-03: Very nice results, from SIGGRAPH of course. Don’t get distracted by the not very good clothing animation.

2018-09-02: Best of SIGGRAPH 2018.

Translation

hmm

GT now gets 55% accuracy on English to Arabic. Human agreement on human translations is 60%. After this point they have no standard by which to measure their progress

2016-09-27: Getting amazingly close to human level performance. it’s interesting that for all languages, the gap between human and perfect translation is much much larger than between human and machine.

Neural Machine Translation: Much better translation quality
Full technical report (23 exciting pages of bedtime reading)

Research blog post

I’m very excited to announce that our new neural machine translation system closes the quality gap between the existing Google Translate production system and human quality translations by 58% to 87% for a variety of different language pairs (see table below, from the technical report we published today). This work has been a close collaboration between the Google Brain team and the Google Translate team.

Thanks to lots of hard engineering work and the computational efficiency of our Tensor Processing Units (see report), we are also rolling these benefits out to users of Google Translate, starting today with Mandarin to English as the first language pair live in production that uses this new system. We’ll be rolling out many more language pairs over the coming weeks.

This highlights the success of neural models at more accurately capturing the complexities of real human language, and is a powerful demonstration of the research our group has been doing on language understanding.

2016-11-15: Nice behind the scenes article on the recent translation breakthrough.

With this update, Google Translate is improving more in a single leap than we’ve seen in the last 10 years combined.

3 overlapping stories converge in Google Translate’s successful metamorphosis to A.I. — a technical story, an institutional story and a story about the evolution of ideas. The technical story is about 1 team on 1 product at 1 company, and the process by which they refined, tested and introduced a brand-new version of an old product in only about a quarter of the time anyone, themselves included, might reasonably have expected. The institutional story is about the employees of a small but influential artificial-intelligence group within that company, and the process by which their intuitive faith in some old, unproven and broadly unpalatable notions about computing upended every other company within a large radius. The story of ideas is about the cognitive scientists, psychologists and wayward engineers who long toiled in obscurity, and the process by which their ostensibly irrational convictions ultimately inspired a paradigm shift in our understanding not only of technology but also, in theory, of consciousness itself.

2023-07-08: Akkadian translation, with modest BLEU scores.

In its transliteration to English test, the AI model scored 37.47. In its cuneiform to English test, it scored 36.52. Both scores were above their target baseline and in the range of a high-quality translation. The model was able to reproduce the nuances of each test sentence’s genre. The AI model works best when it is translating short- to medium-length sentences. It also does better with more formulaic genres, like royal decrees and administrative records, than literary genres such as myths, hymns, and prophecies. With more training on a larger dataset, they aim to improve its accuracy. “100s of 100s of clay tablets inscribed in the cuneiform script document the political, social, economic, and scientific history of ancient Mesopotamia. Yet, most of these documents remain untranslated and inaccessible due to their sheer number and limited quantity of experts able to read them”