What is the best quality OCR available?

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

I am not sure OCR will work at all on images of 96 DPI. 100% accuracy
will probably be obtained if you work on a scanned image at 300 dpi of
an original which is a first generation laser print out.

Ravi

Peter Olcott

2006-09-14 17:18:19 UTC

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

That was my best estimate too. I had Nuance techs try OmniPage 15 on the sample
from the link below, and it completely failed to match any characters. ABBYY
FineReader 8.0 Professional Edition provided the same results when their techs
tested their software on this same sample.

http://www.seescreen.com/Unique.html

My best current estimate is the SeeScreen is unique in the world in its ability
to recognize
characters as small as 6 points at 96 DPI screen resolutions with 100% accuracy.

--
Patented SeeScreen enables
programs to see anything on
the computer display screen
www.SeeScreen.com

Claudio Grondi

2006-09-14 20:03:07 UTC

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

That was my best estimate too. I had Nuance techs try OmniPage 15 on the sample
from the link below, and it completely failed to match any characters. ABBYY
FineReader 8.0 Professional Edition provided the same results when their techs
tested their software on this same sample.
http://www.seescreen.com/Unique.html
My best current estimate is the SeeScreen is unique in the world in its ability
to recognize
characters as small as 6 points at 96 DPI screen resolutions with 100% accuracy.

This appears trivial to me as the shape of screen fonts is 100%
predictable, so no OCR in the usual sense is required covering the huge
variety of possible patterns - for me the reason for the 100% accuracy.

May someone explain here what the patent behind SeeScreen is actually
about?
I have trouble to get the core of the message from the published patent
text and the related pictures don't show up in my Internet browser:

http://seescreen.com/
http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&r=1&f=G&l=50&s1=7046848.PN.&OS=PN/7046848&RS=PN/7046848
http://patimg1.uspto.gov/.piw?docid=US007046848&PageNum=12&IDKey=76E347862AFD&HomeUrl=http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1%2526Sect2=HITOFF%2526d=PALL%2526p=1%2526u=%25252Fnetahtml%25252FPTO%25252Fsrchnum.htm%2526r=1%2526f=G%2526l=50%2526s1=7046848.PN.%2526OS=PN/7046848%2526RS=PN/7046848

Claudio Grondi

Peter Olcott

2006-09-14 20:37:26 UTC

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

That was my best estimate too. I had Nuance techs try OmniPage 15 on the
sample from the link below, and it completely failed to match any characters.
ABBYY FineReader 8.0 Professional Edition provided the same results when
their techs tested their software on this same sample.
http://www.seescreen.com/Unique.html
My best current estimate is the SeeScreen is unique in the world in its
ability to recognize
characters as small as 6 points at 96 DPI screen resolutions with 100% accuracy.

Simultaneously processing this huge set of patterns was the tricky part.

May someone explain here what the patent behind SeeScreen is actually about?
I have trouble to get the core of the message from the published patent text

This technology provides the means to simultaneously process millions of
arbitrary graphical objects for recognition. The two key benefits are:
(1) The accuracy rate for text recognition is much higher than any alternative
method.
(2) GUI scripting of every application is now possible.
This latter benefit is derived from the fact that the mouse can now always "see"
where it needs to click.

http://seescreen.com/
http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&r=1&f=G&l=50&s1=7046848.PN.&OS=PN/7046848&RS=PN/7046848
http://patimg1.uspto.gov/.piw?docid=US007046848&PageNum=12&IDKey=76E347862AFD&HomeUrl=http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1%2526Sect2=HITOFF%2526d=PALL%2526p=1%2526u=%25252Fnetahtml%25252FPTO%25252Fsrchnum.htm%2526r=1%2526f=G%2526l=50%2526s1=7046848.PN.%2526OS=PN/7046848%2526RS=PN/7046848
Claudio Grondi

Claudio Grondi

2006-09-15 06:26:59 UTC

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

That was my best estimate too. I had Nuance techs try OmniPage 15 on the
sample from the link below, and it completely failed to match any characters.
ABBYY FineReader 8.0 Professional Edition provided the same results when
their techs tested their software on this same sample.
http://www.seescreen.com/Unique.html
My best current estimate is the SeeScreen is unique in the world in its
ability to recognize
characters as small as 6 points at 96 DPI screen resolutions with 100% accuracy.

Simultaneously processing this huge set of patterns was the tricky part.

May someone explain here what the patent behind SeeScreen is actually about?
I have trouble to get the core of the message from the published patent text

This technology provides the means to simultaneously process millions of
(1) The accuracy rate for text recognition is much higher than any alternative
method.
(2) GUI scripting of every application is now possible.
This latter benefit is derived from the fact that the mouse can now always "see"
where it needs to click.

This claim (2) is in my eyes nothing more than a huge balloon of hot air
as there is much more to 'GUI scripting of every application' than
recognition of graphical objects and texts. If it were actually true we
would have any AI problems already solved by it ...
To come to the point:
SeeScreen appears to me more as a test if the features described will
find a market as a matter of fact of existence of a technology capable
of doing what it pretends to be capable of.
Perfect reading of texts from screen is already a long time successfully
on the market with the right mouse click using Babylon translator
software displaying the text of the word under the mouse pointer along
with its translation.

Claudio Grondi

Peter Olcott

2006-09-15 14:32:51 UTC

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

That was my best estimate too. I had Nuance techs try OmniPage 15 on the
sample from the link below, and it completely failed to match any
characters. ABBYY FineReader 8.0 Professional Edition provided the same
results when their techs tested their software on this same sample.
http://www.seescreen.com/Unique.html
My best current estimate is the SeeScreen is unique in the world in its
ability to recognize
characters as small as 6 points at 96 DPI screen resolutions with 100% accuracy.

Simultaneously processing this huge set of patterns was the tricky part.

May someone explain here what the patent behind SeeScreen is actually about?
I have trouble to get the core of the message from the published patent text

This technology provides the means to simultaneously process millions of
(1) The accuracy rate for text recognition is much higher than any
alternative method.
(2) GUI scripting of every application is now possible.
This latter benefit is derived from the fact that the mouse can now always
"see" where it needs to click.

This is true, another review by another expert pointed out some of these
difficulties. The key problem that this technology solves is providing the means
for the GUI scripting language to always know where it needs to click the mouse.
This now permits applications requiring mouse input to be automated. Another key
problem that this technology solves better than ever before, is providing a
means of data interchange between applications where none existed previously.

SeeScreen appears to me more as a test if the features described will find a
market as a matter of fact of existence of a technology capable of doing what
it pretends to be capable of.
Perfect reading of texts from screen is already a long time successfully on
the market with the right mouse click using Babylon translator software
displaying the text of the word under the mouse pointer along with its
translation.
Claudio Grondi

If a method using all of the elements of the claim below already exists, I need
to know this. SeeScreen technology does not translate from one language to
another. Also there are methods for obtaining text from the computer display
screen that do not involve processing pixels. These methods (when they work at
all) always produce 100% accuracy. If there are any other methods that involve
processing display screen pixels that can achieve 100% accuracy with font size
as small as 6 points at 96 DPI display screen resolutions, I need to know this.

1. A method for constructing a deterministic finite automaton (DFA) for
recognizing machine generated character glyphs in a graphic image, comprising:
collecting character glyphs of a font instance, including all combinations of
overlapping glyphs; merging the identical prefixes of these character glyphs
together; generating the DFA from the merged prefixes; and storing the DFA in a
sparse matrix representation.

The term "Deterministic Finite Automaton (DFA)" is taken to mean any automated
system whereby a next state is entirely determined by a current input and a
current state.

The term "Sparse Matrix" is taken to have the common meaning of the term
"Sparse" combined with the common computer science meaning of the term "Matrix",
a two dimensional array of elements. Thus a "Sparse Matrix" is to include every
means of storing any two dimensional array of elements such that the number of
elements stored is significantly less than the number of columns times the
number of rows in the matrix. This includes, but, is not limited to storing a
two dimensional array as a single dimensional array.

Claudio Grondi

2006-09-15 15:25:43 UTC

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

That was my best estimate too. I had Nuance techs try OmniPage 15 on the
sample from the link below, and it completely failed to match any
characters. ABBYY FineReader 8.0 Professional Edition provided the same
results when their techs tested their software on this same sample.
http://www.seescreen.com/Unique.html
My best current estimate is the SeeScreen is unique in the world in its
ability to recognize
characters as small as 6 points at 96 DPI screen resolutions with 100% accuracy.

Simultaneously processing this huge set of patterns was the tricky part.

May someone explain here what the patent behind SeeScreen is actually about?
I have trouble to get the core of the message from the published patent text

This technology provides the means to simultaneously process millions of
(1) The accuracy rate for text recognition is much higher than any
alternative method.
(2) GUI scripting of every application is now possible.
This latter benefit is derived from the fact that the mouse can now always
"see" where it needs to click.

If a method using all of the elements of the claim below already exists, I need
to know this. SeeScreen technology does not translate from one language to
another. Also there are methods for obtaining text from the computer display
screen that do not involve processing pixels. These methods (when they work at
all) always produce 100% accuracy. If there are any other methods that involve
processing display screen pixels that can achieve 100% accuracy with font size
as small as 6 points at 96 DPI display screen resolutions, I need to know this.
1. A method for constructing a deterministic finite automaton (DFA) for
collecting character glyphs of a font instance, including all combinations of
overlapping glyphs; merging the identical prefixes of these character glyphs
together; generating the DFA from the merged prefixes; and storing the DFA in a
sparse matrix representation.
The term "Deterministic Finite Automaton (DFA)" is taken to mean any automated
system whereby a next state is entirely determined by a current input and a
current state.
The term "Sparse Matrix" is taken to have the common meaning of the term
"Sparse" combined with the common computer science meaning of the term "Matrix",
a two dimensional array of elements. Thus a "Sparse Matrix" is to include every
means of storing any two dimensional array of elements such that the number of
elements stored is significantly less than the number of columns times the
number of rows in the matrix. This includes, but, is not limited to storing a
two dimensional array as a single dimensional array.

Now I am exactly at the point at which I have tried to get some hints
about how and what the method described in the patent does - up to now
in vain.
Sorry, I don't understand the claims - maybe because the language used
sounds alien to me being not able to grasp what DFA or a 'Sparse Matrix'
is for in the given context - (stupid) me understands it like:
we take _one_ CPU and process the entire graphical information of a
screenshot in parallel (on multiple CPUs? or what does it mean in
parallel) what does speed up the process of recognition of screen
elements letting all other already known methods miles behind.
I would be glad to hear here what exactly is, what has not been done
before and is covered by the patent - best from experts in the area of
OCR which seem to stay away from this thread yet.

From reading the comp.compression I know how many US patents exist on
methods without any sense and practical use granted often because of
lack of expertize to reject them or knowledge about prior art.
So the only way to see oneself if a patented method makes sense is to
understand it - because it is not enough to know that a patent was
granted to assume it is real - as it can happen that the only purpose of
a patent is to get fundings for further research.

An online demonstration of the technology would sure help here as anyone
could see with own eyes if it works and how long does it take to get a
response after a screenshot was submitted.

Claudio Grondi

Peter Olcott

2006-09-15 15:36:27 UTC

Post by Claudio Grondi

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

That was my best estimate too. I had Nuance techs try OmniPage 15 on the
sample from the link below, and it completely failed to match any
characters. ABBYY FineReader 8.0 Professional Edition provided the same
results when their techs tested their software on this same sample.
http://www.seescreen.com/Unique.html
My best current estimate is the SeeScreen is unique in the world in its
ability to recognize
characters as small as 6 points at 96 DPI screen resolutions with 100% accuracy.

Simultaneously processing this huge set of patterns was the tricky part.

Post by Claudio Grondi
May someone explain here what the patent behind SeeScreen is actually about?
I have trouble to get the core of the message from the published patent

This technology provides the means to simultaneously process millions of
(1) The accuracy rate for text recognition is much higher than any
alternative method.
(2) GUI scripting of every application is now possible.
This latter benefit is derived from the fact that the mouse can now always
"see" where it needs to click.

This is true, another review by another expert pointed out some of these
difficulties. The key problem that this technology solves is providing the
means for the GUI scripting language to always know where it needs to click
the mouse. This now permits applications requiring mouse input to be
automated. Another key problem that this technology solves better than ever
before, is providing a means of data interchange between applications where
none existed previously.

If a method using all of the elements of the claim below already exists, I
need to know this. SeeScreen technology does not translate from one language
to another. Also there are methods for obtaining text from the computer
display screen that do not involve processing pixels. These methods (when
they work at all) always produce 100% accuracy. If there are any other
methods that involve processing display screen pixels that can achieve 100%
accuracy with font size as small as 6 points at 96 DPI display screen
resolutions, I need to know this.
1. A method for constructing a deterministic finite automaton (DFA) for
recognizing machine generated character glyphs in a graphic image,
comprising: collecting character glyphs of a font instance, including all
combinations of overlapping glyphs; merging the identical prefixes of these
character glyphs together; generating the DFA from the merged prefixes; and
storing the DFA in a sparse matrix representation.
The term "Deterministic Finite Automaton (DFA)" is taken to mean any
automated system whereby a next state is entirely determined by a current
input and a current state.
The term "Sparse Matrix" is taken to have the common meaning of the term
"Sparse" combined with the common computer science meaning of the term
"Matrix", a two dimensional array of elements. Thus a "Sparse Matrix" is to
include every means of storing any two dimensional array of elements such
that the number of elements stored is significantly less than the number of
columns times the number of rows in the matrix. This includes, but, is not
limited to storing a two dimensional array as a single dimensional array.

Now I am exactly at the point at which I have tried to get some hints about
how and what the method described in the patent does - up to now in vain.
Sorry, I don't understand the claims - maybe because the language used sounds
alien to me being not able to grasp what DFA or a 'Sparse Matrix' is for in
we take _one_ CPU and process the entire graphical information of a screenshot
in parallel (on multiple CPUs? or what does it mean in parallel) what does
speed up the process of recognition of screen elements letting all other
already known methods miles behind.
I would be glad to hear here what exactly is, what has not been done before
and is covered by the patent - best from experts in the area of OCR which seem
to stay away from this thread yet.
From reading the comp.compression I know how many US patents exist on methods
without any sense and practical use granted often because of lack of expertize
to reject them or knowledge about prior art.
So the only way to see oneself if a patented method makes sense is to
understand it - because it is not enough to know that a patent was granted to
assume it is real - as it can happen that the only purpose of a patent is to
get fundings for further research.
An online demonstration of the technology would sure help here as anyone could
see with own eyes if it works and how long does it take to get a response
after a screenshot was submitted.
Claudio Grondi

The working prototype is not yet ready for formal review. Two things that are
known with certainty are:
(1) 100% accuracy can be achieved with fonts as small as 6 points at 96 DPI
screen resolution.
(2) This processing will take between 1/10 second and one second per 1280 * 1024
screen on a 2.4 Ghz single processor computer without hyperthreading.

Milind Joshi

2006-09-20 23:26:38 UTC

As far as I know, Microsoft's Accessibility API captures text quite
easily and accurately. We wrote a progrfam several years ago that does
this, and its not that hard to do.

It doesn't recognize images, but it can read just about any screen text
you can throw at it, and font and size doesn't matter...

In what way is SeeScreen different?

Milind Joshi
IDEA TECHNOSOFT INC.
http://www.ideatechnosoft.com

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

Peter Olcott

2006-09-21 01:14:16 UTC

Post by Milind Joshi
As far as I know, Microsoft's Accessibility API captures text quite
easily and accurately. We wrote a progrfam several years ago that does
this, and its not that hard to do.
It doesn't recognize images, but it can read just about any screen text
you can throw at it, and font and size doesn't matter...
In what way is SeeScreen different?

It can recognize text embedded within an image.

It handles all the cases where accessibility has not been provided for. It also
can recognize images. It is estimated that this technology will be able to
recognize thousands of arbitrary graphical objects from millions of alternatives
in less than one second. The real numbers on this performance will be available
very soon.

One of the main advantages of this technology is that it provides the means to
make GUI scripting much easier, and available to virtually any application or
combinations of applications, desktop, command line, or web. Now for the first
time, a script can ALWAYS see where to click the mouse. This enables automating
applications that require mouse input. It is even expected that this technology
will be able to play and win live action video games. This same capability could
have alternative uses.

Post by Milind Joshi
Milind Joshi
IDEA TECHNOSOFT INC.
http://www.ideatechnosoft.com

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

Milind Joshi

2006-09-26 17:36:10 UTC

Sounds like it sure has applications!

It can recognize text embedded within an image.
It handles all the cases where accessibility has not been provided for. It also
can recognize images. It is estimated that this technology will be able to
recognize thousands of arbitrary graphical objects from millions of alternatives
in less than one second. The real numbers on this performance will be available
very soon.
One of the main advantages of this technology is that it provides the means to
make GUI scripting much easier, and available to virtually any application or
combinations of applications, desktop, command line, or web. Now for the first
time, a script can ALWAYS see where to click the mouse. This enables automating
applications that require mouse input. It is even expected that this technology
will be able to play and win live action video games. This same capability could
have alternative uses.

Post by Milind Joshi
Milind Joshi
IDEA TECHNOSOFT INC.
http://www.ideatechnosoft.com

Post by Peter Olcott
Is there any OCR software that can approximate 100% accuracy at 96 DPI? What is
the most accurate OCR available that can process text from the display screen?

Peter Olcott

2006-10-03 02:43:00 UTC

Are their any applications of this technology in your business? One of the
applications that was suggested when I asked for suggestions in another forum
was data scanning/mining applications.

Post by Milind Joshi
Sounds like it sure has applications!