Introducing Fuzzy Layers For Deep Learning: Stanton R. Price Steven R. Price Derek T. Anderson
Introducing Fuzzy Layers For Deep Learning: Stanton R. Price Steven R. Price Derek T. Anderson
Introducing Fuzzy Layers For Deep Learning: Stanton R. Price Steven R. Price Derek T. Anderson
Abstract—Many state-of-the-art technologies developed in re- technologies across many fields, e.g., pattern recognition, ma-
cent years have been influenced by machine learning to some chine learning (ML), neural networks (NNs), computational
extent. Most popular at the time of this writing are artificial intelligence, evolutionary computation, and so on. Recently,
intelligence methodologies that fall under the umbrella of deep
learning. Deep learning has been shown across many applications much buzz has surrounded deep learning (DL) for its ability to
to be extremely powerful and capable of handling problems that provide desirable results for a number of different applications.
possess great complexity and difficulty. In this work, we introduce AI has been shown to have great potential for finding opti-
a new layer to deep learning: the fuzzy layer. Traditionally, mized solutions to various problems across multiple domains.
the network architecture of neural networks is composed of an This technology has been heavily researched for computer
input layer, some combination of hidden layers, and an output
layer. We propose the introduction of fuzzy layers into the vision applications [1], [2], speech translations [3], [4], and
deep learning architecture to exploit the powerful aggregation optimization tasks [5], [6]. Herein, the focus is on an extremely
properties expressed through fuzzy methodologies, such as the relevant and popular branch of AI: deep learning. DL has
Choquet and Sugueno fuzzy integrals. To date, fuzzy approaches achieved much success in recent years on computer vision
taken to deep learning have been through the application of applications and has benefited from the surge of attention
various fusion strategies at the decision level to aggregate
outputs from state-of-the-art pre-trained models, e.g., AlexNet, being given to advance its theories to being extremely gen-
VGG16, GoogLeNet, Inception-v3, ResNet-18, etc. While these eralizable for many problems. Part of DL’s recent resurgence
strategies have been shown to improve accuracy performance is the implementation of its network architecture being well
for image classification tasks, none have explored the use of suited for processing on powerful, highly parallelized GPUs.
fuzzified intermediate, or hidden, layers. Herein, we present a This has allowed for extremely complex and deep network
new deep learning strategy that incorporates fuzzy strategies
into the deep learning architecture focused on the application of architectures to be developed that were infeasible to implement
semantic segmentation using per-pixel classification. Experiments on older compute technologies.
are conducted on a benchmark data set as well as a data set Fusion is a powerful technique used to combine informa-
collected via an unmanned aerial system at a U.S. Army test site tion from different sources. These sources could be different
for the task of automatic road segmentation, and preliminary features extracted, decisions, sensor output, etc., as well as dif-
results are promising.
Index Terms—fuzzy layers, deep learning, fuzzy neural nets, ferent combinations thereof. Fusion methodologies often strive
semantic segmentation, fuzzy measure, fuzzy integrals to improve system performance by combining information in
a beneficial way that enables more discriminatory power to the
I. I NTRODUCTION system in some form. This could be through the realization of
Artificial intelligence (AI) has emerged over the past decade more robust features that generalize well from one domain to
as one of the most promising technologies for advancing another. For the task of classification, fusion could be used to
mankind in a multitude of ways, from medicine discovery combine multiple decision makers, e.g., classifiers, to improve
and disease diagnostics to autonomous vehicles, semantic overall accuracy performance. Fusion is a very rich and
segmentation, and personal assistants. For many years there powerful technique that, when implemented appropriately, can
has been a desire to create computer algorithms/machines that lead to major algorithm improvements. Fusion is commonly
are able to replace or assist humans in signal understanding associated with fuzzy techniques, as Fuzzy Logic [7], [8] nat-
for tasks such as automatic buried explosive hazard detection, urally lends itself to gracefully considering data with varying
vehicle navigation, object recognition, and object tracking. AI degrees of belief. Ensemble approaches [9] are also commonly
has grown to loosely encompass a number of state-of-the-art used for fusion tasks. Generally, most fusion strategies attempt
to properly assign weights that encode the significance, or
This work was partially supported under the Maneuver in Complex Environ- importance, to the different information sources, and these
ments R&D program to support the U.S. Army ERDC. This effort is also based
on work supported by the Defense Threat Reduction Agency/Joint Improvised- weights are the driving mechanism behind the fused result.
Threat Defeat Organization (DTRA/JIDO). Any use of trade names is for Historically, weights used when fusing multiple information
descriptive purposes only and does not imply endorsement by the U.S. sources are either human-derived or found via an optimization
Government. Permission to publish was granted by Director, Geotechnical
and Structures Laboratory, U.S. Army ERDC. Approved for public release; function/strategy such as group lasso [10], [11]. However,
distribution is unlimited. there has been little-to-no research done on utilizing DL for
Fig. 3. Road masks shown (right column) for two sample images.