» 

Wikipedia

Aggregation (linguistics)

From Wikipedia

Jump to: navigation, search

Aggregation is a subtask of Natural language generation, which involves merging syntactic constituents (such as sentences and phrases) together. Sometimes aggregation is also done at a conceptual level.

Contents

Examples

A simple example of syntactic aggregation is merging the two sentences John went to the shop and John bought an apple into the single sentence John went to the shop and bought an apple.

Syntactic aggregation can be much more complex than this. For example, aggregation can embed one of the consituents in the other; eg, we can aggregate John went to the shop and The shop was closed into the sentence John went to the shop, which was closed.

From a pragmatic perspective, aggregating sentences together often suggests to the reader that these sentences are related to each other. If this is not the case, the reader may be confused. For example, someone who reads John went to the shop and bought an apple may infer that the apple was bought in the shop; if this is not the case, then these sentences should not be aggregated.

A simple example of conceptual aggregation is replacing Saturday and Sunday by weekend.

Algorithms and Issues

Aggregation algorithms must do two things:

  • Decide when two constituents should be aggregated
  • Decide how two constitutents should be aggregated, and create the aggregated structure

The first issue, deciding when to aggregate, is poorly understood. Aggegration decisions certainly depend on the semantic relations between the constituents, as mentioned above; they also depend on the genre (eg, bureaucratic texts tend to be more aggregated than instruction manuals). They probably should depend on rhetorical and discourse structure [1]. The literacy level of the reader is also probably important (poor readers need shorter sentences) [2]. But we have no integrated model which brings all these factors together into a single algorithm.

With regard to the second issue, there have been some studies of different types of aggregation, and how they should be carried out. A good recent paper is Harbusch and Kempen [3], who describe several syntactic aggregation strategies and also include references to previous papers in this area. In their terminology, John went to the shop and bought an apple is an example of Forward Conjuction Reduction.

Much less is known about conceptual aggregation. Di Eugenio et al [4] show how conceptual aggregation can be done in an intelligent tutoring system, and demonstrate that performing such aggregation makes the system more effective (and that conceptual aggregation make a bigger impact than syntactic aggregation).

Software

Unfortunately there is not much software available for performing aggregation. However the simplenlg system[4] [5] does include limited support for basic aggregation. For example, the following code causes simplenlg to print out The man is hungry and buys an apple.

<source lang="java">SPhraseSpec x1 = new SPhraseSpec("the man", "be", "hungry");SPhraseSpec x2 = new SPhraseSpec("the man", "buy", "an apple");SPhraseSpec result = ClauseAggregator.newInstance().apply(s1, s2);Realiser realiser = new Realiser();System.out.println(realiser.realise(result));</source>

References

  1. D Scott and C de Souza (1990). Getting the Message Across in RST-based Text Generation. In Dale et al (eds)Current Research in Natural Language Generation. Academic Press
  2. S Williams and E Reiter (2008). Generating basic skills reports for low-skilled readers. Natural Language Engineering 14:495-535
  3. K Harbusch and G Kempen (2009). Generating clausal coordinate ellipsis multilingually: A uniform approach based on postediting. In Proc of ENLG-2009 28:105-144. [1]
  4. B Di Eugenio, D Fossati, D Yu (2005). Aggregation improves learning: experiments in natural language generation for intelligent tutoring systems. In Proc of ACL-2005 pp 50–57. [2]
  5. A Gatt and E Reiter (2009). SimpleNLG: A realisation engine for practical applications. Proceedings of ENLG09 [3]

 

All translations of Aggregation (linguistics)


sensagent's content

  • definitions
  • synonyms
  • antonyms
  • encyclopedia

Dictionary and translator for handheld

⇨ New : sensagent is now available on your handheld

   Advertising ▼

sensagent's office

Shortkey or widget. Free.

Windows Shortkey: sensagent. Free.

Vista Widget : sensagent. Free.

Webmaster Solution

Alexandria

A windows (pop-into) of information (full-content of Sensagent) triggered by double-clicking any word on your webpage. Give contextual explanation and translation from your sites !

Try here  or   get the code

SensagentBox

With a SensagentBox, visitors to your site can access reliable information on over 5 million pages provided by Sensagent.com. Choose the design that fits your site.

Business solution

Improve your site content

Add new content to your site from Sensagent by XML.

Crawl products or adds

Get XML access to reach the best products.

Index images and define metadata

Get XML access to fix the meaning of your metadata.


Please, email us to describe your idea.

WordGame

The English word games are:
○   Anagrams
○   Wildcard, crossword
○   Lettris
○   Boggle.

Lettris

Lettris is a curious tetris-clone game where all the bricks have the same square shape but different content. Each square carries a letter. To make squares disappear and save space for other squares you have to assemble English words (left, right, up, down) from the falling squares.

boggle

Boggle gives you 3 minutes to find as many words (3 letters or more) as you can in a grid of 16 letters. You can also try the grid of 16 letters. Letters must be adjacent and longer words score better. See if you can get into the grid Hall of Fame !

English dictionary
Main references

Most English definitions are provided by WordNet .
English thesaurus is mainly derived from The Integral Dictionary (TID).
English Encyclopedia is licensed by Wikipedia (GNU).

Copyrights

The wordgames anagrams, crossword, Lettris and Boggle are provided by Memodata.
The web service Alexandria is granted from Memodata for the Ebay search.
The SensagentBox are offered by sensAgent.

Translation

Change the target language to find translations.
Tips: browse the semantic fields (see From ideas to words) in two languages to learn more.

last searches on the dictionary :

6558 online visitors

computed in 0.047s

   Advertising ▼

Advertize

Partnership

Company informations

   Advertising ▼