How do locally infrequent species influence numerical classification? A simulation study

A. Lengyel, J. Csiky, Z. Botta-Dukát

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

Phytosociological databases are important data sources for a broad scale of ecological investigations. Vegetation samples are traditionally managed and published in tabular format, allowing for handling of the vegetation data in various combinations. Such tables usually comprise relevés originated from the same locality, vegetation type and collected by the same investigator. Nevertheless, these relevés are usually affected by the same bias. In this paper, we demonstrate the importance of the effects acting at the level of the table (i.e., 'locally'), using the example of species removals from groups of relevés. We examine the effect of the removal of infrequent species on community classification in relation with several data set properties using simulated plot data sampled from simulated coenoclines. A data set comprised groups of relevés ('tables'), within which relevés are sampled from the same point of the coenocline. Classifications obtained after the removal or permutation of infrequent species occurrences from these tables, after the removal of rare species from randomised tables and without any treatment were compared to a reference classification based on gradient positions of the relevés. The results show that the removal of locally infrequent species helps to recognise the gradient pattern incorporated in the tabular arrangement of relevés if the arrangement of relevés among tables is in accordance with their gradient position. In cases when the grouping of relevés is irrelevant regarding the real underlying pattern, the species removal is disadvantageous. Testing between-table heterogeneity within a data set is an especially successful way of examination of biological relevance of the arrangement of relevés. We conclude that influence of table-level effects is mainly dependent on the pattern which is in accordance with the grouping of plots.

Original languageEnglish
Pages (from-to)64-71
Number of pages8
JournalCommunity Ecology
Volume13
Issue number1
DOIs
Publication statusPublished - Jun 1 2012

Fingerprint

taxonomy
simulation
vegetation
vegetation types
species occurrence
rare species
vegetation type
removal
testing
sampling
effect

Keywords

  • Coenocline
  • Multivariate analysis
  • Noise elimination
  • Phytosociology
  • Rare species
  • Vegetation databases

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Ecology

Cite this

How do locally infrequent species influence numerical classification? A simulation study. / Lengyel, A.; Csiky, J.; Botta-Dukát, Z.

In: Community Ecology, Vol. 13, No. 1, 01.06.2012, p. 64-71.

Research output: Contribution to journalArticle

@article{d03fc5a0420e4476b135315104d576f9,
title = "How do locally infrequent species influence numerical classification? A simulation study",
abstract = "Phytosociological databases are important data sources for a broad scale of ecological investigations. Vegetation samples are traditionally managed and published in tabular format, allowing for handling of the vegetation data in various combinations. Such tables usually comprise relev{\'e}s originated from the same locality, vegetation type and collected by the same investigator. Nevertheless, these relev{\'e}s are usually affected by the same bias. In this paper, we demonstrate the importance of the effects acting at the level of the table (i.e., 'locally'), using the example of species removals from groups of relev{\'e}s. We examine the effect of the removal of infrequent species on community classification in relation with several data set properties using simulated plot data sampled from simulated coenoclines. A data set comprised groups of relev{\'e}s ('tables'), within which relev{\'e}s are sampled from the same point of the coenocline. Classifications obtained after the removal or permutation of infrequent species occurrences from these tables, after the removal of rare species from randomised tables and without any treatment were compared to a reference classification based on gradient positions of the relev{\'e}s. The results show that the removal of locally infrequent species helps to recognise the gradient pattern incorporated in the tabular arrangement of relev{\'e}s if the arrangement of relev{\'e}s among tables is in accordance with their gradient position. In cases when the grouping of relev{\'e}s is irrelevant regarding the real underlying pattern, the species removal is disadvantageous. Testing between-table heterogeneity within a data set is an especially successful way of examination of biological relevance of the arrangement of relev{\'e}s. We conclude that influence of table-level effects is mainly dependent on the pattern which is in accordance with the grouping of plots.",
keywords = "Coenocline, Multivariate analysis, Noise elimination, Phytosociology, Rare species, Vegetation databases",
author = "A. Lengyel and J. Csiky and Z. Botta-Duk{\'a}t",
year = "2012",
month = "6",
day = "1",
doi = "10.1556/ComEc.13.210.1556/ComEc.13.2012.1.812.1.8",
language = "English",
volume = "13",
pages = "64--71",
journal = "Community Ecology",
issn = "1585-8553",
publisher = "Akademiai Kiado",
number = "1",

}

TY - JOUR

T1 - How do locally infrequent species influence numerical classification? A simulation study

AU - Lengyel, A.

AU - Csiky, J.

AU - Botta-Dukát, Z.

PY - 2012/6/1

Y1 - 2012/6/1

N2 - Phytosociological databases are important data sources for a broad scale of ecological investigations. Vegetation samples are traditionally managed and published in tabular format, allowing for handling of the vegetation data in various combinations. Such tables usually comprise relevés originated from the same locality, vegetation type and collected by the same investigator. Nevertheless, these relevés are usually affected by the same bias. In this paper, we demonstrate the importance of the effects acting at the level of the table (i.e., 'locally'), using the example of species removals from groups of relevés. We examine the effect of the removal of infrequent species on community classification in relation with several data set properties using simulated plot data sampled from simulated coenoclines. A data set comprised groups of relevés ('tables'), within which relevés are sampled from the same point of the coenocline. Classifications obtained after the removal or permutation of infrequent species occurrences from these tables, after the removal of rare species from randomised tables and without any treatment were compared to a reference classification based on gradient positions of the relevés. The results show that the removal of locally infrequent species helps to recognise the gradient pattern incorporated in the tabular arrangement of relevés if the arrangement of relevés among tables is in accordance with their gradient position. In cases when the grouping of relevés is irrelevant regarding the real underlying pattern, the species removal is disadvantageous. Testing between-table heterogeneity within a data set is an especially successful way of examination of biological relevance of the arrangement of relevés. We conclude that influence of table-level effects is mainly dependent on the pattern which is in accordance with the grouping of plots.

AB - Phytosociological databases are important data sources for a broad scale of ecological investigations. Vegetation samples are traditionally managed and published in tabular format, allowing for handling of the vegetation data in various combinations. Such tables usually comprise relevés originated from the same locality, vegetation type and collected by the same investigator. Nevertheless, these relevés are usually affected by the same bias. In this paper, we demonstrate the importance of the effects acting at the level of the table (i.e., 'locally'), using the example of species removals from groups of relevés. We examine the effect of the removal of infrequent species on community classification in relation with several data set properties using simulated plot data sampled from simulated coenoclines. A data set comprised groups of relevés ('tables'), within which relevés are sampled from the same point of the coenocline. Classifications obtained after the removal or permutation of infrequent species occurrences from these tables, after the removal of rare species from randomised tables and without any treatment were compared to a reference classification based on gradient positions of the relevés. The results show that the removal of locally infrequent species helps to recognise the gradient pattern incorporated in the tabular arrangement of relevés if the arrangement of relevés among tables is in accordance with their gradient position. In cases when the grouping of relevés is irrelevant regarding the real underlying pattern, the species removal is disadvantageous. Testing between-table heterogeneity within a data set is an especially successful way of examination of biological relevance of the arrangement of relevés. We conclude that influence of table-level effects is mainly dependent on the pattern which is in accordance with the grouping of plots.

KW - Coenocline

KW - Multivariate analysis

KW - Noise elimination

KW - Phytosociology

KW - Rare species

KW - Vegetation databases

UR - http://www.scopus.com/inward/record.url?scp=84862115148&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84862115148&partnerID=8YFLogxK

U2 - 10.1556/ComEc.13.210.1556/ComEc.13.2012.1.812.1.8

DO - 10.1556/ComEc.13.210.1556/ComEc.13.2012.1.812.1.8

M3 - Article

VL - 13

SP - 64

EP - 71

JO - Community Ecology

JF - Community Ecology

SN - 1585-8553

IS - 1

ER -