Central Florida Memory
Collection
Browse All
Maps
Photographs
Postcards
Most Recent
More...
Advanced Search
Preferences
My Favorites
Help
Share
About the Project
Additional Resources
Credits & Contact Info
Partners
Tell Us What You Think
More Info...
Learn
Florida Stories
Teachers
Exhibits
More Info...
add to favorites
:
reference url
back to results
:
previous
:
next
DATA MINING METHODS FOR MALWARE DETECTION
Access this item.
Title
DATA
MINING
METHODS
FOR
MALWARE
DETECTION
Author
Siddiqui, Muazzam
Keywords
Data Mining
Malware Detection
Machine Learning
Classification
Instruction Sequences
Signature Extraction
Predictive Modeling
Supervised Learning
Unsupervised Learning
Feature Selection
Feature Reduction
Abstract
This
research
investigates
the
use
of
data
mining
methods
for
malware
(malicious
programs)
detection
and
proposed
a
framework
as an
alternative
to the
traditional
signature
detection
methods.
The
traditional
approaches
using
signatures
to
detect
malicious
programs
fails
for the
new
and
unknown
malwares
case
,
where
signatures
are not
available.
We
present
a
data
mining
framework
to
detect
malicious
programs.
We
collected
,
analyzed
and
processed
several
thousand
malicious
and
clean
programs
to
find
out
the
best
features
and
build
models
that
can
classify
a
given
program
into a
malware
or a
clean
class.
Our
research
is
closely
related
to
information
retrieval
and
classification
techniques
and
borrows
a
number
of
ideas
from the
field.
We
used
a
vector
space
model
to
represent
the
programs
in
our
collection.
Our
data
mining
framework
includes
two
separate
and
distinct
classes
of
experiments.
The
first
are the
supervised
learning
experiments
that
used
a
dataset
,
consisting
of
several
thousand
malicious
and
clean
program
samples
to
train
,
validate
and
test
, an
array
of
classifiers.
In the
second
class
of
experiments
,
we
proposed
using
sequential
association
analysis
for
feature
selection
and
automatic
signature
extraction.
With
our
experiments
,
we
were
able
to
achieve
as
high
as
98.4%
detection
rate
and as
low
as
1.9%
false
positive
rate
on
novel
malwares.
Adviser
Wang, Morgan
Publisher
University
of
Central
Florida
Degree
Ph.D.
Degree Discipline
Other
Degree Grantor
Sciences
Degree Program
Modeling and Simulation PhD
Graduation Date
2008-01-01
Type
Doctoral dissertation
Access Level
Public - Allow Worldwide Access
Release Date
2008-09-05
Repository
University Archives
Repository Collection
Electronic Theses and Dissertations
Identifier
CFE0002303
Access Link
http://purl.fcla.edu/fcla/etd/CFE0002303
add to favorites
:
reference url
back to results
:
previous
:
next
powered by CONTENTdm
®
|
contact us
^ to top ^
About
Partners
Contact Us
LSTA
IMLS