Automated Feature Extraction with Machine Learning and Image Processing

PD Stefan Bosse

University of Siegen - Dept. Maschinenbau
University of Bremen - Dept. Mathematics and Computer Science

1 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features -

Data and Data Features

Metrics and taxonomy of Data

Features of Data

Analysis of Data

2 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data

Data

In general, data and their values can be divided into:

Scalar values, such as temperature, age, etc.
Series of scalar values, such as time series
Vector and matrix values such as images
Composite data, i.e. data structures (records)
Temporal-spatial data, i.e. time-dependent spatial data series, D={D(p,t)={d(p)_i}} with i = {1,2,3,..,t}, p=⟨x,y,..⟩

Data have dimensionality 𝕏^N

The values of 𝕏 are a dimension from the discrete number set ℕ, real number set ℝ, and the time scale 𝕋 or any categorical value sets 𝕊 (or subsets thereof), e.g., 𝕏=ℝ × ℝ × ℕ.

3 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data Reduction

Data Reduction

The aim of data analysis is to reduce input data in terms of size and dimensionality:

$P(X^N): X^N \rightarrow Y^M\\ |Y|<|X|, M<N$

Materials science, metrology, and construction engineering uses:

Commonly metric input variables;
Often metric or categorical output variables (incl. Boolean variables)

4 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data Reduction

Data Reduction

function isRaining(temp,sunrad,moisture) = {
  if (temp < 0)                    FALSE
  else if (temp > 40)              FALSE
  else if ((sunrad-moisture) > 30) FALSE
  else                             TRUE
}

A R example from measurement technology with a data reduction function ℝ³ → 𝔹

5 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data classes

Data classes

Numerical and Metric values: These are values that are countable and where you can meaningfully define relations (such as smaller or larger), i.e. for all real and integers.

Examples: temperature, length, density, pore size, elongation, force, location, time

Categorical values: These are symbolic values for which either no (meaningful) order relation exists or where at least no differences can be formed.

Examples: nationality, color names (red < yellow???), Damage type, characteristic feature (anomaly?)

6 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data classes

m = 1
m = [1.0,1.5,2.5]
c = 'A'
c = ['A','B','A']
c = [TRUE,FALSE,TRUE]
c = factor(m,levels=[1,1.5,2,2.5],labels=['A','B','C','D'])

R examples of numerical and categorical values and conversion (factorization)

7 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data classes

Scaling of numerical values

Interval scaled: For this type of attributes, only differences (addition or subtraction) make sense. For example, the temperature measured in °C or °F is interval scaled. If it is 20 °C on one day and 10 °C on the following day, it makes sense to talk about a temperature drop of 10 °C, but it does not make sense to say that it is twice as cold as the day before (C(K)∼K, but F(K)/∼K!!).

Ratio scaled: Here you can calculate both differences and ratios between values. For example, for age, one can say that someone who is 20 years old is twice as old as someone who is 10 years old, and 20 is > 10.

8 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data classes

Order relations

Nominal: The attribute values in the domain are unordered and therefore only equality comparisons make sense. That is, we can only check whether the value of the attribute is the same for two specific instances or not. For example, gender is a nominal attribute.

Ordinal: The attribute values are ordered and thus equality comparisons (is one value equal to another?) and relational comparisons (is one value smaller or larger than another?) are allowed, although it may not be possible to quantify the difference between the values!

9 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data Aggregations

Data Aggregations

Vectors (columns, one dimensional)
Lists (field record, one dimensional)
Matrices (two dimensional)
Arrays (multi dimensional)
Tables (data frames organized in rows and columns)

v = c(4)            v = [1.0,1.5,2.5]
v[1] = 1.2
l = list(a=1,b=2)   l = {a=1,b=2}   l={1.0,1.5,2.5}
l$a = 9
m = matrix(0,nrow=2,ncol=3)
m = [1,2,3;4,5,6]
a = array(0,dim=[3,2,4])
df = data.frame(a={1,2,3},b={3,4,5})

R examples of aggregated data)

10 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data classes (longitudinal)

Data classes (longitudinal)

Sensor and measurement data variables (both categorical and metric) can be further distinguished in:

Static: The variable s is not variable in time or is to be regarded as stationary (immutable) in a significant time interval t ∈ [t₀, t₁].

Dynamic: The variable s(t) is time-dependent and forms a data series (or time vector) s(t)={s₀,s₁,..s_t} in the case of discrete acquisition, i.e., we are talking about longitudinal data.

A digitized sensor signal is always discrete in time, but the physical variable that the sensor measures is continuous in time (note the sampling theorem)

11 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data

Data

Data sets as matrices

Data can be represented in matrix form as matrix D (analogy to table form) [1]:

12 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Data

The vector X is the set of all variables X_i and represent the columns of the matrix D:

$\vec{X} = (X_{1},X_{2},..,X_{d})$

Each row x_j is a record of the variable set X={X_i|i=1,d} with values x and represent an individual example, instance, experiment, entitie, object, and feature vector as a d-digit tuple, depending on the application and objective:

$\vec{d}_j = \vec{x}_j = (x_{j,1},x_{j,2},..,x_{j,d})$

df = data.frame(
  X1={'x1,1','x1,2','...'},
  X2={'x2,1','x2,2','...'},
  X3={'x3,1','x3,2','...'}
)
print(df)
      X1      X2     X3 == X
1  "x1,1" "x2,1" "x3,1"
2  "x1,2" "x2,2" "x3,2"
3  "..."  "..."  "..."

13 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Input and Output Variables

Input and Output Variables

The variable set is composed of input and output variables: X_xy=X ∩ Y
Sensors are commonly input variables X
Statements are output variables Y, i.e. results that can be derived from the input variables (by a function F):

$\vec{X}_{xy} = (X_{1},X_{2},..,X_{u},Y_{1},Y_{2},..,Y_{v}) \\ \vec{X} = (X_{1},X_{2},..,X_{u}) \\ \vec{Y} = (Y_{1},Y_{2},..,Y_{v}) \\ \vec{d}_j = (x_{j,1},x_{j,2},..,x_{j,u},y_{j,1},y_{j,2},..,y_{j,v}) \\ F(\vec{X}): \vec{X} \rightarrow \vec{Y},$

with u+v=d.

14 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Example of a data matrix

Example of a data matrix

Botanical data set with geometric (numerical) properties of a plant and categorical classification:

15 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Example of a data matrix

Measurement data set

Computed Strain-stress diagram

www.precifast.de/elastizitaetsmodul-e-modul

Measurement data from strain test

Strain [mm]	Force [kN]
0	0
0.1	0.2
0.2	0.7
0.3	1.5
0.4	1.7
0.5	1.9
0.6	2.0
0.7	0.2
0.8	-0.5

16 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Example of a data matrix

tt = data.frame(
  Strain = [0.0,0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8],
  Force  = [0.0,0.2,0.7,1.5,1.7,1.9,2.0,0.2,-0.5]
)

Measure data stored in a R data.frame

17 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Example of a data matrix

Attributes

The measured variables X₁ to X₄ are metric data variables, the variable X₅=y is a categorical variable!
The measured variables X₁ to X₄ (i.e. sensors) are called attributes because they are properties and descriptive variables of the target variable y.

High-dimensional Data

Images I=I(x,y[,z]) are commonly two- or three-dimensional spatial data, organised in rows and columns (and levels)
Spatiotemporal data T=T(x,y[,z],t) is commonly three- or four dimensional and organised in rows, columns (levels), and discrete time points t.

18 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Sensors

Sensors

Which sensors and measurement data do you know:

19 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Sensor

Sensor

Measurement
- Physical quantities such as temperature, strain, stress, time, absorption
- Merged survey variables (e.g. ensemble mean values, outliers, ..)
When measuring with sensors, a distinction is made between:
- Single or single measurements (single shot)
- Repeated measurements of the same physical quantity (averaging..)
- Series of measured values, especially time-resolved data series:
  D = {d₁,d₂,..,d_n}, where commonly Δt(d_i,d_i+1) is constant

20 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Sensor

Sensor

Socio-technical systems, surveys
- Survey variables (answers to questions) are sensors of individual people
- Merged survey variables (e.g. ensemble mean values) are sensors of groups of people
Generally available data
- Social networks and social media
- Databases of authorities, etc.

21 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Sensor model

Sensor model

A sensor is a transducer (indicator for a property that is not directly measurable)
A sensor therefore generally maps a physical quantity x to another quantity y:

$S(x): x \rightarrow y,K:correct(x\rightarrow y)$

There is usually a calibration function ${K}{\left({f},{x},{y}\right)}$
Examples are:
- Pressure → Voltage, Radiation → current, etc.
- Social networking → Numerical radius value, votes → Politics, i.e., Assignment of numbers to objects or events according to established rules

22 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Sensor data

Sensor data

Sensors S are data sources d of physical, sociological or other natural variables x that cannot be detected directly
The data values (numeric) will be in a definable interval

Knowledge of the value interval is important for later data processing, analysis, and machine learning!
Categorical values are also defined by a set

$S(x): x \rightarrow d \\ d \in [a,b] \Rightarrow \{v_0,v_1,..,v_i\}$

23 / 85

PD Stefan Bosse - AFEML - Module A: Data and Data Features - Sensor data

24 / 85

PD Stefan Bosse - AFEML - Module A: Measurement and sensory systems - Sensor data

Measurement and sensory systems

The origin of data for analysis and machine learning!

A sensor rarely comes alone.

25 / 85

PD Stefan Bosse - AFEML - Module A: Measurement and sensory systems - Measurement methods

Measurement methods

A distinction is made between two different measurement methods:

Passive measuring method (P): The sensory values are the result of an intrinsic property (e.g., density) or already existing external variables (temperature). The stimulus of the measurement is the component, the person, the environment.

Active measurement methods (A): There is an active stimulus whose response signal is detected by the sensor. An example is the ultrasonic measurement method with guided waves. The sensor signal is always dependent on the stimulus. In sociology, for example, the stimulus is a catalog of questions in a survey, the answers are the sensor variables.

26 / 85

PD Stefan Bosse - AFEML - Module A: Measurement and sensory systems - Measurement methods

Acoustic Emission measuring technologies can belong to both classes,

27 / 85

PD Stefan Bosse - AFEML - Module A: Measurement and sensory systems - Measurement methods

Acoustic Emission measuring technologies can belong to both classes,

Guided Ultrasonic Waves belong to class A, and

28 / 85

PD Stefan Bosse - AFEML - Module A: Measurement and sensory systems - Measurement methods

Acoustic Emission measuring technologies can belong to both classes,

Guided Ultrasonic Waves belong to class A, and

X-ray imaging belongs commonly only to class P.

29 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Measurement methods

Signal Features

Statistical Features
Spatial Features (Images, geometric features)
Frequency and spectral Features /time and space)
Differences to reference signals
Transformed Signals

30 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

Assumption: Data series

But any image can be transformed into a pixel data series, too!
Any column of a data table is a data series (but independent values and unordered!)

There is a data series d related to one variable x(from sensor s):

$\vec{{d}}={\left\lbrace{d}_{{1}},{d}_{{2}},\ldots,{d}_{{n}}\right\rbrace},{s}:{x}\to{d}$

31 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

Feature	Formula
Sample Size	${n}$
Extrema	$\min{\left({x}\right)},\max{\left({x}\right)}$
Sample Mean	$\overline{{x}}=\frac{{{\sum_{{{i}={0}}}^{{{n}}}}{x}_{{i}}}}{{n}}$
Standard Deviation	${s}=\sqrt{{\frac{{{\sum_{{{i}={0}}}^{{{n}}}}{\left({x}_{{i}}-\overline{{x}}\right)}^{{2}}}}{{n}}}}$
Sample Variance	${s}^{{2}}=\frac{{{\sum_{{{i}={0}}}^{{n}}}{\left({x}-\overline{{x}}_{{i}}\right)}^{{2}}}}{{n}}$

... and many more

32 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

use math
Force  = [0.0,0.2,0.7,1.5,1.7,1.9,2.0,0.2,-0.5]
statsForce = fivenum(Force)
statsForce$std = sd(Force)
cprint(statsForce)
{min : -0.5 , q1 : 0.2 , median : 0.7 , mean : 0.855 , 
 q3 : 1.7 , max : 2, sd: 0.93}

Statistical analysis of data series or vectors in R

33 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

Feature	Formula
N-th moment about point a, e.g., $a=\bar{x}$	$\mu_n \left( a \right) = \sum {\left( {x - a} \right)^n P\left( x \right)}$
Gaussian Distribution	$P(x) = \frac{1}{{\sigma \sqrt {2\pi } }}e^{{{ - \left( {x - \mu } \right)^2 } \mathord{\left/ {\vphantom {{ - \left( {x - \mu} \right)^2 } {2\sigma^2 }}} \right.} {2\sigma^2 }}}$
Fisher Skewness	$\gamma_1 = \frac{\mu_3}{\mu_2^{3/2}} = \frac{\mu_3}{\sigma^3}, \sigma=\sqrt \mu_2$

... and many more

34 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

use math
Force  = [0.0,0.2,0.7,1.5,1.7,1.9,2.0,0.2,-0.5]
mn = moment(Force,order=2,central=TRUE)
print(mn)

Higher order moment analysis of data series or vectors in R

35 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

Meaning of higher order moments (Wikipedia)

36 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

Statistical analysis is applied to the same static variable X with unordered values from repeated measurements of X under the same conditions

37 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

Statistical analysis is applied to the same static variable X with unordered values from repeated measurements of X under the same conditions

Statistical measures for data series (e.g., time-dependent) of dynamic variables with values from measurements under different conditions are not valid ("non-sense"). But statistical measures can be still used as signal features posing a correlation between the input signal and the target features (e.g., damages), e.g., the mean value or higher order moments.

38 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Statistical Features

Statistical Features

An ordered data series {d_i} can be considered as an ordered series of different variables {X_i}!

Finally, all statistical features create a new input vector (for ML) X^f derived from the original input variables X:

$\text{Stat}{\left({X}\right)}:{X}\to{X}^{{f}}\\ {X}={\left({X}_{{1}},..,{X}_{{i}}\right)},{X}^{{f}}={\left({{X}_{{1}}^{{f}}},..,{{X}_{{j}}^{{f}}}\right)}\text{},{i}\gg{j}$

39 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Image Features

Low Level

Histogram H(I)={h₁,..,h_k}, where each histogram variable represents the number of pixels within an intensity interval [i,i+Δ] (can be split into separate RGB histograms for colour images)
Average (mean) intensity I, noise (intensity distribution statistics)
Extrema intensities min(I), max(I)
Frequency spectrum F(I)={f₁,..,f_s}, where each frequency represents a wavenumber in the wave room
Intensity gradients and profiles along lines (axis)
Addition and subtraction of images (using, e.g., base-line reference images)

40 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Image Features

High Level

Intensity gradients
Edges
Geometrical figures
Object clusters
Regions-of-interest (ROI), defined by bounding boxes or closed polygons
Labelled and classified ROIs
Feature point markings
Threshold Binarization (dimensionality reduction and feature amplification)

41 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Transformations

Reduce Picture Dimension

A simple way to reduce the dimension of our feature vector is to decrease the size of the image with decimation (downsampling) by reducing the resolution of the image.

If the color component is not relevant, we can also convert pictures to grayscale to divide the number dimension by three.
Intensity homogenisation using transfer functions

A two-dimensional mathematical matrix is a grayscale image, a three-dimensional mathematical matrix is a color image.

42 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Color Spaces

RGB: Three channels per pixel for each color R(ed), G(reen), B(lue) providing the color intensity
RGBA: RGB with an additional alpha (tranparency) channel
Grayscale: One channel per pixel providing the intensity (average or luminescence)

Conversion from color to grayscale uses a specific color model transformation. Be careful.

43 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Color Spaces

Average RGB ⇒ Grayscale transformation

${I}{\left({x},{y}\right)}=\frac{{{R}{\left({x},{y}\right)}+{G}{\left({x},{y}\right)}+{B}{\left({x},{y}\right)}}}{{3}}$

More natural color weighted luma RGB ⇒ Grayscale transformation

${I}{\left({x},{y}\right)}={0.299}{R}{\left({x},{y}\right)}+{0.587}{G}{\left({x},{y}\right)}+{0.114}{B}{\left({x},{y}\right)}$

RGBA ⇒ Grayscale transformation

${I}{\left({x},{y}\right)}=\frac{{{f{{\left({R}{\left({x},{y}\right)}\right)}}}+{f{{\left({G}{\left({x},{y}\right)}\right)}}}+{f{{\left({B}{\left({x},{y}\right)}\right)}}}}}{{3}}\\ {f{{\left({i},{a}\right)}}}={\left({1}-{a}\right)}_{{k}}+{a}_{{i}}\\ {a}=\frac{{{A}{\left({x},{y}\right)}}}{{k}}\\ {k}={255}$

44 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Look-up Tables

Intesnity distributions can be transformed with continous functions e.g., an exponential gamma correction, or by using a look-up table.

A look-up table can be considered as a discrete mapping function f(x): x → y, whereby the index, i.e,, a specific row, is given by the (discrete) x value, and y is the value in the specific row.
Only meaningful for small and discrete intensity value ranges, e.g., 8 Bit [0,255]
Only rough approximation of an intensity transfer function with continous value distributions, but fast method!

45 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Look-up Tables

use plot,math,imager
vals = [1,3,5,6,7.5,8,8.5,9,9.5,10]
mylut = lut(vals,range=[0,9])
img = matrix(runif(100)*10,10,10)
img.isca = mylut(img)
plot(img,auto.scale=TRUE)
hist(img,breaks=20)
plot(img.isca,auto.scale=TRUE)
hist(img.isca,breaks=20)

LUT function in R(+) applied to a random matrix

46 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Histogram of Oriented Gradient

The HOG feature descriptor is a popular technique used in computer vision and image processing for detecting objects in digital images.

The HOG descriptor is a type of feature descriptor that encodes the shape and appearance of an object by computing the distribution of intensity gradients in an image.

47 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Histogram of an Image

use math,plot
img = matrix(runif(100),10,10)
plot(img,auto.scale=TRUE)
hist(img,ylim=[0,1])
img[img>0.5]=1
plot(img,auto.scale=TRUE)
hist(img,ylim=[0,1])

Histogram of a uniformly distributed random image and image binarization

48 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Intensity Homogenization

The intensity of an image can vary significantly across the spatial x-y plane, e.g., as a result of the measuring method and conditions.

Image processing and transformation algorithms can be sensitive to intensity inhomogeneity.
Algorithms:
- Histogram Equalization (HE), Brightness Preserving Bi-Histogram Equalization (BBHE)
- Geometrical Image Intensity Equalization
- Model-based (physical model of illumination)

49 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Microcracks Image

Intensity Profiles

50 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Histogram Equalization

https://docs.opencv.org/3.4/d4/d1b/tutorial_histogram_equalization.html

It is a method that improves the contrast in an image, in order to stretch out the intensity range.
From the image below, you can see that the pixels seem clustered around the middle of the available range of intensities.

51 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

https://github.com/YuAo/Accelerated-CLAHE

Histogram equalization (HE) is a method in image processing of contrast adjustment using the image's histogram.
This method usually increases the global contrast of many images, especially when the usable data of the image is represented by close contrast values.
Through this adjustment, the intensities can be better distributed on the histogram.

This allows for areas of lower local contrast to gain a higher contrast and attention in visual inspection.

52 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Histogram Equalization stretch out this range.
Equalization implies mapping one distribution (the given histogram) to another distribution (a wider and more uniform distribution of intensity values) so the intensity values are spread over the whole range.
To accomplish the equalization effect, the remapping should be the cumulative distribution function (cdf). For the histogram H(i), its cumulative distribution H_cd(i) is (N: Number of pixels):

${H}_{{{c}{d}}}{\left({i}\right)}=\frac{{\sum_{{{0}\le{j}<{i}}}{H}{\left({j}\right)}}}{{N}}$

Finally, we use a simple remapping procedure to obtain the intensity values of the equalized image:

${I}_{{{e}{q}}}{\left({x},{y}\right)}={H}_{{{c}{d}}}{\left({I}{\left({x},{y}\right)}\right)}$

53 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Cummulative Distribution Function (CDF)

use math,plot
m=matrix(runif(100),10,10)
h=hist(m,ylim=[0,1],breaks=20,plot=FALSE)
print(h$density)
cdf=vector('numeric',length(h$density))
for (i in 1:length(h$density)) {
  cdf[i]=sum(h$density[1:i])
}
plot(cdf,auto.scale=TRUE,main='CDF')

Higher order moment analysis of data series or vectors in R

54 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Spatial Image Intensity Equalization

This simple Histogram Equalization is not sensitive to spatial intensity inhomogeneities and variations! Spatial uniform intensity distributions are assumed!

Intensity variations can be a result of a statistical process or due to the measuring technology and conditions
- Variation can be considered as an overlay (addition) to the "real" measuring signal s(x,y)v(x,y)+n(x,y), and noise n
Methods based on a spatial filtering of the images use the assumption that the bias field (intensity inhomogeneity) consists of a low spatial frequency intensity variation ⇒ Applying a High-pass filter in the wavenumber space!?
Low-pass filtering methods can be used to extract non-uniformity

55 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Trivial Approach

Assumption:
1. There is only one axis in the image with low-frequency intensity variations due to inhomogeneous illumination
2. The image content has statistically averaged homogeneous, i.e., equally distributed (small) features like cracks
The mean image intensity I_mean(p) can be computed along a line l(p) (parametric equation, orientation by visual inspection along the strongest intensity variation/gradient) by using the average intensity along the perpendicular line at each point p:

${x}_{{l}}={x}_{{0}}+{a}{p}\\ {y}_{{l}}={y}_{{0}}+{b}{p}\\ {l}{\left({p}\right)}:{p}\to{\left({x},{y}\right)}\\ {l}_{{\bot}}{\left({p},{q}\right)}:{q}\to{\left({x},{y}\right)}$

56 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

(Left) Computing the average intensity I_avg(p) perpendicular to a line along the intensity gradient (Right) Correct all pixels perpendicular to the correction line with a equalization factor

57 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Contrast Limited Adaptive Histogram Equalization

https://github.com/YuAo/Accelerated-CLAHE

CLAHE (Contrast Limited Adaptive Histogram Equalization) is an algorithm for enhancing local contrast in images, and is frequently used in application areas like underwater photography, traffic control, astronomy, and medical imaging.

CLAHE can also be used in the tone mapping operation of displaying a HDR (High Dynamic Range) image.

58 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Adaptive histogram equalization (AHE) differs from ordinary histogram equalization in the respect that the adaptive method computes several histograms, each corresponding to a distinct section of the image, and uses them to redistribute the lightness values of the image.
It is therefore suitable for improving the local contrast and enhancing the definitions of edges in each region of an image.
AHE has a tendency to overamplify noise in relatively homogeneous regions of an image.
- A variant of adaptive histogram equalization called contrast limited adaptive histogram equalization (CLAHE) prevents this by limiting the amplification.

59 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Compute the neighborhood histogram for each pixel in the image.
Clip each histogram at a predefined value and redistribute the clipped histogram equally among all the histogram bins.
Compute the CDF (Cumulative Distribution Function) and transformation function for each pixel using the clipped histogram.
Apply the transformation function to each pixel to get the equalized image.

The basic CLAHE algorithm

60 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Frequency Transformation

Time-dependent signal s(t) can be transformed in the frequency space S(ω) by using a frequency transformation, e.g., Discrete Fourier Transformation (DFT):

${\left|{D}{F}{T}{\left({s}\right)}\right|}:{s}{\left({t}\right)}\to{S}{\left(\omega\right)}\\ {D}{F}{T}{\left({\left\lbrace{x}_{{n}}\right\rbrace}\right)}:{\left\lbrace{x}_{{n}}\right\rbrace}\to{\left\lbrace{X}_{{k}}\right\rbrace}\\ {X}_{{k}}=\sum_{{{0}\le{n}<{N}}}{x}_{{n}}{e}^{{\frac{{-{2}{i}\pi}}{{N}}{k}}}\\ {X}_{{k}}=\sum_{{{0}\le{n}<{N}}}{x}_{{n}}{\left({\cos{{\left(\frac{{{2}\pi}}{{N}}{k}{n}\right)}}}-{i}{\sin{{\left(\frac{{{2}\pi}}{{N}}{k}{n}\right)}}}\right)}$

61 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

The DFT transforms a series of complex numbers {x_n} into a sequence of complex numbers {X_k}.
- The transformation is reversible (as long as complex numbers, i.e., magnitude and phase, is preserved).
Low-, High-, and Bandpassfiltering can be performed by applying a mask function to the frequency distribution {X_k} and transforming back into time-space (blending in frequency space)

TU Graz, IVU_frequency_2017

62 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

2D DFT

Images can be transformed into the frequency space, too, called wavenumber space
A two-dimensional (2D) DFT is used (output is a matrix, too)

${I}{F}{\left({k},{l}\right)}=\sum_{{{0}\le{m}<{N}}}\sum_{{{0}\le{n}<{N}}}{I}{\left({m},{n}\right)}{e}^{{-{2}{i}\pi{\left({k}\frac{{m}}{{N}}+{l}\frac{{n}}{{N}}\right)}}}$

TU Graz, IVU_frequency_2017

63 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

The signal frequency distribution is symmetric!

TU Graz, IVU_frequency_2017

64 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Wavelet Decomposition

Disadvantage of Fourier transformations is the lost of the time or spatial information.

A solution can be the application of a moving window of size m ≪ n, with n as the sample size (time signal: number of time samples, image: width and height).
- But: The Fourier transformation delivers m/2 frequencies
- If the window size is lowered, the time or spatial resolution increases, but the frequency resolution decreases!

Wavelet decomposition is a way of breaking down a signal in both space and frequency. In the case of pictures, this means breaking down the image into its horizontal, vertical, and diagonal components.

65 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Wavelet Decomposition

Parida et al.,2017 Decomposition of an image 2-D discrete wavelet transform with filter banks (2-D DWT)

66 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Wavelet Decomposition

Bosse et al., doi:10.3390/computers10030034 Example of a DWT signal decomposition of a US time-dependent signal

67 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

An image wavelet is a two-dimensional function Φ(x,y), and we need two.dimensional convolution operations. Time consuming!

Examples of 2D wavelets (Left) Haar (Right) Max Hat https://www.section.io/engineering-education/wavelet-transform-analysis-of-images-using-waveletanalyzer-toolbox-in-matlab/

68 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Wavelet Decomposition

Instead performing a 2-D wavelet convolution, we can apply the 1-D transformation to the rows and columns of images as separable 2-D transformations.
In most applications where wavelets are used for image processing, this approach is more practical due to the low computational complexity of separable transformations.
Each decomposition reduces the image size by a factor 2 in each dimension: DWT: M × M → M/2 × M/2;
The DWT decomposition can be repeated by using the ouput of the previous level

69 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Wavelet Decomposition

Wavelet 1st Level

Wavelet 2nd Level

https://www.section.io/engineering-education/wavelet-transform-analysis-of-images-using-waveletanalyzer-toolbox-in-matlab/

70 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Wavelet Decomposition and Reconstruction

Wavelet Image Decomposition

Wavelet Image Reconstruction

71 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Image Gradient

The (intensity) gradient of an image is the vector ∇I(x,y). It is characterized by a magnitude m and a direction φ in the image:

72 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Image Laplacian

Another important image transformation is the Laplacian of an image with intensity I(x,y) that is defined by:

Invariant to image rotations.
The laplacian is often used in image enhancement to increase contour effects

Higher sensitivity to noise than the gradient.

73 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Edge Detection

Two main strategies:

Gradient strategy: detection of the local extrema in the gradient direction.
Laplacian strategy: detection of zero-crossing.

These strategies rely on the fact that edges correspond to 0-order discontinuities of the intensity function.
The derivative computation requires a pre-filtering of the images.
- For instance: linear filtering for zero mean noises (e.g. white Gaussian noise and Gaussian filter) and non-linear filtering for impulse noise (median filter).
Since all edge detection results are easily affected by the noise in the image, it is essential to filter out the noise to prevent false detection caused by it. To smooth the image, a Gaussian filter kernel is convolved with the image.

74 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Edge Detection: Sobel Derivative Filter

The Sobel filter is a x- and y-sensitive gradient filter by using a convolution operation with two 3×3 kernels.. The x- and y-gradients are merged finally in one image.

use math,imager,plot
img.sobel <- sobelEdges(img,blur=2,gradient=TRUE)
print(summary(img.sobel))
plot(img.sobel,auto.scale=TRUE)

Sobel edge filter. The gaussian blurring is essential to reduce noise.

75 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Edge Detection: Canny Filter

The canny edge filter is a multi-stage algorithm. After denoising, intensity gradients of the image are computed ofr x- and y-direction, then a non-maximum suppression is applied, finally applying a hysteris threhold filtering.

use math,imager,plot
img.canny <- cannyEdges(img,t1=0,t2=50,blur=4)
print(summary(img.canny))
plot(img.canny,auto.scale=TRUE)

Canny edge filter. The gaussian blurring is essential to reduce noise. The edge detection thresholds t₁ and t₂ relate to the intensity gradient and must be set carefully. https://docs.opencv.org/4.x/da/d22/tutorial_py_canny.html

76 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Image Features

Kernel-based Convolution Algorithms

Convolution is using a kernel matrix to extract certain features from images.

A kernel is a matrix, which is shifted across the image and multiplied with the input pixels covered by the kernel matrix such that the output is transformed in a certain desirable manner. Watch this in action below.

https://towardsdatascience.com/types-of-convolution-kernels-simplified-f040cb307c37

77 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Geometric Transformations

Geometric Transformations

Simple geometrical operations of entire image or parts of the image are:

Translation;
Rotation around a specific position;
Scaling.

Advanced geometrical operations of entire image:

Linear affine transformations (including combinations of simple operations from above)
Image warping (using affine transformations)
Non-linear transformations for the correction of geometric distortions like Barrel and Pincushin ⇒ Fisheye Correction
Perspective transformations (perspective warping)

78 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Geometric Distortions

Geometric Distortions

Local geometric distortions caused by optical imaging (lense distortion) https://www.image-engineering.de/library/image-quality/factors/1062-distortion

79 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Measurement error and confidence

Measurement error and confidence

Systematic deviation (systematic error)

Deviation is caused by the sensor, environment, and sometimes physical processes
E.g.: incorrect calibration, constantly existing faults such as friction
Can only be eliminated by carefully examining the source of the error

Random deviation (Random or statistical error)

Deviation is caused by unavoidable, irregular disturbances
with repeated measurement, individual results differ from each other
Individual results vary by an average value

80 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Measurement error and confidence

Measurement error and confidence

Random error scattering

Random errors affect the accuracy of a measurement (noise).
Noise affects input and target feature computation (ML output)!
If one repeats a measurement of a quantity X which is falsified by pure random errors, the frequency distribution of the measured values is S = {s₁, s₂,...,s_n} by a mean value $\bar S$ given by a Gaussian distribution (the number of measurements N must be large).

81 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Measurement error and confidence

9 figgaussdist

Frequency distribution according to Gauss of measured values centered around an average value

82 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Examples: Statistical Analysis

Examples: Statistical Analysis

83 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Summary

Summary

Data can be classified into:
- Categorical variables and values
- Metric variables and values
- Temporal static variables
- Temporal dynamic variables (time series)
All sensor variables are subject to measurement errors:
- Noise
- Distortion
- Displacement (bias)
- Problem of reproducibility and systematic errors (environment!)
A (statistical) data analysis is often the first step in the ML workflow

84 / 85

PD Stefan Bosse - AFEML - Module A: Signal Features - Summary

Summary

There are different levels of sensor data features
- Aggregates like statstical measures
- Time- and freqency domain features
- Spatial features like edges in images or geometric properties
- Region-of-Interest Markinh
- Semantic features, i.e., classified features like damages

The signal feature selection and extraction is the first step to compute and detect target features like damages using data-driven models.

85 / 85