Software Scientific: the competitive edge of intelligence.

Machine Intelligence

Technology Comparison


This page compares our technology with other approaches in commercial use.

To err is human, but to really foul things up requires a computer.
Farmers' Almanac, 1978

Unlike many companies, Software Scientific is not a one-technology shop. Instead, we will deploy whatever technique is appropriate, often combining many to offset the weaknesses of one approach with the strengths of another.

Our APIs remain focused on the task in hand, rather than the mechanics of accomplishing that task.

Comparing the Summariser with other techniques.

Statistical systems

At this time, there are no known statistical methods for achieving the functionality which is available to Lectern users (the underlying technology is the Concept Engine). It is difficult to envisage how statistical methods could be extended to incorporate the functionality required.

Despite these limitations, statistical methods have been applied to some of the functions which are offered by the Concept Engine. For instance on the Internet, some search engines are starting to offer summaries. However, the claim being made quite explicitly here is that the Concept Engine technology produces better results than statistical techniques, and this can be demonstrated quite readily with simple tests.

Statistical methods operate by calculating word frequency and correlating between associated words; this does not equate very well to the way in which people use language, and the results are erratic. The Concept Engine understands the meaning of words, and gives more reliable results when summarising text.


Rule Based,
Deep Grammars,
& Computational Linguistic systems

Another approach that might seem reasonable at first sight is the Computational Linguistic approach, which uses many rules to 'understand' the text.

Such rule based systems are hardwired and they rely on numerous exceptions being built into them to cope with the anomalies of language. The core functionality would become increasingly unreliable as more sophisticated demands were put on it.

Furthermore, rule based methods usually rely on large dictionaries of noun and verb data; this makes the summarising process quite slow and not readily extendable to multiple languages.

Comparison Tables

In the following tables, our technology is in italics.

Concept Engine and Summariser

Abstracting
 SpeedMulti-lingualCan be query focused?Quality
Discussion flow analysisFastYesYes"Indicative" abstracts
StatisticalFastYesNoPoor
Deep grammarsSlowNoNoCan invert meaning by accident

Relevance Ranking
SpeedParallel multi-lingual"Find Similar"QualityMeaning aware
Discussion flow analysisFastYesYesExcellentYes
'Word Bag' approachFastNoNoPoorNo
StatisticalFastNoNoFairNo

Converting Natural Language to Boolean
SpeedAccuracyMachine overheadStaff overheadRelative cost
Concept EngineFastHighLowLowLow
ManualVery slowHigh for expert usersNoneExperts onlyHigh

Bullets

Free-text searching
SpeedRelevance rankingImagesOverhead
BulletsFastGoodYesLow (10-15%)
Boolean inverted indexFastPoorNoHigh (50-100%)
Thesaurus inverted indexFastFairNo High (70-125%)
N.L. inverted indexFastGoodNoHigh (90-150%)
Discussion flow analysisSlowExcellentNoNone (0%)
Neural Net /
Pattern Recognition
MediumVery poorYesVarious

Automatic Language Detection

SpeedAccuracyMachine overheadStaff overheadRelative cost
Auto DetectFastHighLowLowLow
N-gramsFastMediumLowLowLow
ManualVery slowVery highNoneHighHigh