United States   > change
Innovation Home
Princeton University
People, innovation and fun: Xerox executive discusses leadership and technology

NSF Frontier Series
Corporate Innovation Strategies in a Global Econom

Vandebroek talks Services Research at the First Services Research Innovation Initiative Symposium
XGS Innovation Thought Leader
Message From the CTO
Current Research Themes
Managing Innovation
GIO Podcast: An Innovation Conversation with Xerox & IBM
Fortune Blog with Sophie Vandebroek
Innovation Organizations
Research and Development
Engineering Center
Intellectual Property Operations
PARC
Innovation Resources
Conferences
Executive Biographies
Focus on Innovation Archive
Innovation Interests
Innovation Newsroom
Multimedia Resources
Publications
Xerox Supports Open Formats
 

Xerox Reveals Breakthrough Software that Categorizes Text and Images at the Same Time

Researchers at Xerox Research Centre Europe (XRCE) have demonstrated a software technology that can link text and general images together - a breakthrough in how online and paper-based information is categorized.

Current tools classify or "tag" either text or images so they can be processed; but until now no one has combined the two effectively, according to computer scientist Marco Bressan who led the research team. By linking image and text-based content, this new software technology significantly improves fundamental document management tasks like retrieving information from a database or automatically routing documents. The result? More complete searches and streamlined business processes.

For example, if a brochure from an isolated hotel in the French Alps describes the hotel's features and includes maps and pictures of mountainous surroundings, the categorizer will automatically discover the content and link the text and the images together. Then someone searching for an isolated mountain lodge within a certain price range would retrieve the brochure even if "isolated lodge in the mountains" were never mentioned in the actual text.

Marco Bressan
Textual & Visual Pattern Analysis Area Manager, XRCE
The research aligns with Xerox's goal of developing smarter documents to make information-based work easier, more efficient and more effective. Bressan believes there are many uses for the new categorization software.

"Suppose a traveler wants to combine vacation photos with a journal to produce an annotated photo album or photoblog recapping vacation highlights," said Bressan. "Because the Xerox categorizer handles both text and visuals, it can identify the photos, automatically match them to the written text and then enrich the visuals with additional information via hyperlinks to a knowledge base such as Wikipedia."

A second application, according to Bressan, could be at Xerox's imaging centers, where the company scans and digitizes documents to create secure, accessible and searchable online information archives for its customers. Currently the process of scanning, labeling and indexing documents is partially supervised by operators. Hybrid categorization can streamline document management in this application, improving accuracy and eliminating manual operations.

Enabling Xerox's hybrid categorizer are recent advances in machine learning and pattern recognition, advances in computer vision and the large body of hybrid content now available. XRCE has extensive experience with text categorization and, in 2005, demonstrated the industry's first generic image categorizer. The new categorizer combines earlier text and image categorizers to handle hybrid content, with powerful results.

"Xerox's hybrid categorizer creates a shared knowledge space between text and images," said Bressan. "The textual information enriches the visual, and the visual information enriches the textual. The whole is ultimately greater than the sum of the parts."

Listen to the Podcast for more details on this technology.

 
Focus on Innovation Archive
2008
Xerox Makes Environmental Remediation Patents Available to All Through Eco-Patent Commons
Scientists Develop 3-D Document Visualization for "No Surprises" Printing
DARPA program builds on PARC foundation in printing large-area, flexible electronics
Xerox Joins IORG
Xerox Research Centre Europe coordinates EU CACAO project to provide cross-language access to online catalogues and libraries
Incubating Inside Xerox Labs: Innovation that Benifits the Workplace, Healthcare, and the Environment
Robert Loce Elected SPIE Fellow
Rochester Engineering Society Celebrates Technical Excellence
Xerox is Among the World's Best Analyst Competing to Win the Edelman Prize for Achievemnt in Operations Research & Analytics
Patent Powerhouse: Xerox Boasts 101 Inventors with 50 or More Patents
2007
Xerox Reveals Breakthrough Software that Categorizes Text and Images at the Same Time
Xerox funds new services laboratory at NC State University
The Science Consultant Program: Bringing Science to Life for 40 Years
Xerox Technology Tricks Counterfeiters
Xerox Opens Its Labs to Journalists on TechDay
R&D Magazine Lauds Xerox FreeFlow VI Software Suite
Getting to 100 before 50; Xerox scientist Bob Loce Reaches Patent Milestone
Xerox to Fund Green, Nano, Imaging Fellowships at MIT School of Engineering
Know-How Results in breakthrough paper: saves trees and money
Xerox Funds 11 New University Research Projects
Surpassing Search: New Xerox text mining software goes beyond "keywords" to deliver more relevant information
Xerox receives the National Medal of Technology
Now You See It, Now You Don't: Xerox Scientists Develop Fluorescent Writing To Deter Counterfeiting
Xerox Scientist Creates 'Color Language' Making Color Matching as Easy as Describing a Color
PARC Scientist Stu Card Wins Franklin Institute Bower Award for Achievement in Science
Inside Innovation at Xerox: Scientists Create a Rainbow of Custom Blended Colors for DocuTech Highlight Color Systems
Xerox's Santokh Badesha Reaches Rare Milestone; Inventor Awarded 150th Patent
Content Centric Networking
Groundbreaking Canadian Nanotechnology Partnership Lays Foundation For Big Success From Tiny Tech
Xerox Awarded 27 Percent More Patents In 2006
2006
2005
2004
2003
2002
2001
Contact Us: for questions about Xerox research and innovation, patents or technology licensing, scientific work and related inquiries, please email: xigwebmaster@xerox.com

Outside Submissions: Xerox encourages and welcomes unsolicited ideas and suggestions. More information on submitting your ideas to Xerox for review can be found here.

If you have any questions, please don't hesitate to contact us by email at Outsidesubmissions@xerox.com.

For all other inquiries, please use the appropriate contacts listed at Contact Xerox.