## G3101 Basics of geological data treatment

Faculty of Science
Autumn 2004
Extent and Intensity
1/2. 3 credit(s) (fasci plus compl plus > 4). Type of Completion: graded credit.
Teacher(s)
RNDr. Petr Sulovský, Ph.D. (lecturer)
Mgr. Renata Čopjaková, Ph.D. (alternate examiner)
Guaranteed by
doc. RNDr. Rostislav Melichar, Dr.
Department of Geological Sciences - Earth Sciences Section - Faculty of Science
Contact Person: Běla Hrbková
Prerequisites
! G3100 Geological Data Treatment
Knowledge of mathematics on secondary school level
Course Enrolment Limitations
The course is only offered to the students of the study fields the course is directly associated with.
fields of study / plans the course is directly associated with
there are 44 fields of study the course is directly associated with, display
Course objectives
The curriculum is focused on the acquisition of theoretical background to statistical analysis of numerical data in geosciences and their practical treatment using Microsoft Excel. The students are acquainted with principles of probability, descriptive (exploratory)statistics, statistical inference, parametric and non-parametric testing of hypotheses, characterisation of multidimensional datasets by regression and correlation analysis, time-series analysis, multivariate methods of statistical analysis (cluster, discriminant, factor etc.).
Syllabus
• 1. Introduction. Teach-in. The notion of data, types of geological data. Stages of data analysis: Data acquisition; analysis and choice of data. Formalisation (codification and standardization) of data. Data recording and classification. Exploratory data analysis, graphical presentation, types of graphs used in geosciences. Factual interpretation and formulation of results závěrů.

2. History and present of statistics. Examples of usage of statistics in geology. Basic statistical terms: statistical unit, statistical variable (qualitative/ quantitative; ordinal; continuous / discrete; alternative), statistical population (unidimensional, multidimensional). Definition of probability, random variable.

3. Description of univariate data. Random sampling. Distribution of data in the population - freequency distribution. Frequency - absolute, relative, cummulative. Graphing the frequency distribution; probability paper.

4. Basic statistical parameters Median, quantiles, mode, range. Moments: arithmetic mean, variance, standard deviation, coefficient of variation), skewnes, curtosis. Geometrical mean. Harmonical mean.

5. Basic types of frequency distributions. Distributions - normal, log-normal, binomial aand Poisson, special types (t-, F, chi square distributions). Examples of geological phenomena and their frequency distributions. Statistical inference. Estimates of the parameters of population. Properties of estimates, consistence, accuracy, robustness.

6. Testing statistical hypotheses Basic terms and testing procedures. Errors of the first and second type. Goodness of fit tests, test of variance, significance tests of difference between means. Paired tests. Identification of outliers. Analysis of variance(one-way, two-way). Non-parametrical testing (test for randomness, Wilcoxon test, Mann-Whitney test)

7. Statistical description of relations between variables. Correlation analysis. Regression analysis (simple linear correlation, nonlinear correlation, multiple correlation.

8. Multivariate statistical methods. Discriminant analysis. Cluster analysis (hierarchical, nonhierarchical methods, fuzzy clustering). Factor analysis, principle components method.

9. Time series analysis Moving average method, Cox-Jenkins method, ARIMA.

Literature
• WONNACOTT, Thomas H. Statistika pro obchod a hospodářství. Praha: Victoria Publishing.,. 891 s. ISBN 80-85605-09-0. info
• LEPŠ, Jan and Petr ŠMILAUER. Multivariantní analýza ekologických dat. 2001. info
• ZVÁRA, Karel. Biostatistika. 1. vyd. Praha: Karolinum, 1998. 210 s. ISBN 8071847739. info
• Statistické zpracování experimentálních dat :v chonometrii, biometrii, ekonometrii a v dalších oborech přírodních , technických a společenských věd. Edited by Milan Meloun. 2. vyd. Praha: East Publishing, 1998. xxi, 839 s. ISBN 80-7219-003-2. info
• LEPŠ, Jan. Biostatistika. Vyd. 1. České Budějovice: Jihočeská universita, 1996. 165 s. ISBN 8070401540. info
• Moderní statistické metody pro životní prostředí. Edited by Jiří Militký. [1. vyd.]. Ostrava: Vysoká škola báňská - Technická univerzita Ostrava, 1996. 160 s. ISBN 80-7078-360-5. info
• BRÁZDIL, Rudolf. Statistické metody v geografii : cvičení. 3. vyd. Brno: Vydavatelství Masarykovy univerzity, 1995. 177 s. ISBN 8021012609. info
• BRÁZDIL, Rudolf, Miroslav KOLÁŘ and Pavel PROŠEK. Statistické metody v geografii (Statistical methods in geography). Brno: Masarykova univerzita Brno, 1993. 177 pp. ISBN 80-210-1260. info
• HANOUSEK, Jan and Pavel CHARAMZA. Moderní metody zpracování dat :matematická statistika pro každého. 1. vyd. Praha: Grada, 1992. 210 s. ISBN 80-85623-31-5. info
• ZVÁRA, Karel. Regresní analýza. Vyd. 1. Praha: Academia, 1989. 245 s. ISBN 8020001255. info
• SATTRAN, Vladimír. Numerická klasifikace rudních ložisek. Vyd. 1. Praha: Ústřední ústav geologický v Academii, 1979. 118 s., 1. info
• SATTRAN, Vladimír and Blahomil SOUKUP. Použití matematických metod v geologii. Vyd. 1. Praha: Ústřední ústav geologický v Academii, 1973. 153 s., ii. info
Assessment methods (in Czech)
Průběžné a závěrečný test znalostí a počítačového řešení úloh.
Language of instruction
Czech
Further comments (probably available only in Czech)
The course is taught once in two years.
Information on the per-term frequency of the course: Výuka bude probíhat v podzimním semestru 2005/2006.
The course is taught: every week.
The course is also listed under the following terms Autumn 2007 - for the purpose of the accreditation, Autumn 2003, Autumn 2005, Autumn 2007, Autumn 2011, Autumn 2011 - acreditation, Autumn 2013, Autumn 2014, Autumn 2015, Autumn 2019, Autumn 2020.
• Enrolment Statistics (Autumn 2004, recent)