Data Science: Mounick.V.Gowda ENG18CA0028
Data Science: Mounick.V.Gowda ENG18CA0028
MOUNICK.V.GOWDA
ENG18CA0028
TABEL OF CONTENTES
• WHAT IS DATA SCIENCE?
• WHY DATA SCIENCE?
• DATA SCIENCE COMPONENTS
• DATA SCIENCE PROCESS
• DATA SCIENCE JOB ROLES
• TOOLS OF DATA SCIENCE
• CHALLENGES OF DATA SCIENCE TECHNOLOGY
• SUMMARY AND REFERENCES
WHAT IS DATA SCIENCE?
• Data is the oil for today's world. With the right tools, technologies, algorithms, we can use data and
convert it into a distinctive business advantage
• Data Science can help you to detect fraud using advanced machine learning algorithms
• It helps you to prevent any significant monetary losses
• Allows to build intelligence ability in machines
• You can perform sentiment analysis to gauge customer brand loyalty
• It enables you to take better and faster decisions
• Helps you to recommend the right product to the right customer to enhance your business
DATA SCIENCE COMPONENTS
Statistics:
• Statistics is the most critical unit of Data Science basics. It is the method or science of collecting and analyzing numerical data
in large quantities to get useful insights.
Visualization:
• Visualization technique helps you to access huge amounts of data in easy to understand and digestible visuals.
Machine Learning:
• Machine Learning explores the building and study of algorithms which learn to make predictions about unforeseen/future data.
Deep Learning:
• Deep Learning method is new machine learning research where the algorithm selects the analysis model to follow.
DATA SCIENCE PROCESS
1. Discovery:
• Discovery step involves acquiring data from all
the identified internal & external sources which
helps you to answer the business question.
The data can be:
• Logs from webservers
• Data gathered from social media
• Census datasets
• Data streamed from online sources using APIs
2. Preparation:
• Data can have lots of inconsistencies like
missing value, blank columns, incorrect data
format which needs to be cleaned. You need to
process, explore, and condition data before
modeling. The cleaner your data, the better are
your predictions.
3. Model Planning:
• In this stage, you need to determine the method 5. Operationalize:
and technique to draw the relation between input
• In this stage, you deliver the final baselined
variables. Planning for a model is performed by
using different statistical formulas and model with reports, code, and technical
visualization tools. SQL analysis services, R, and documents. Model is deployed into a real-time
SAS/access are some of the tools used for this production environment after thorough testing.
purpose.
6. Communicate
4. Model Building: Results
• In this step, the actual model building process • In this stage, the key findings are communicated
starts. Here, Data scientist distributes datasets for to all stakeholders. This helps you to decide if
training and testing. Techniques like association, the results of the project are a success or a failure
classification, and clustering are applied to the based on the inputs from the model.
training data set. The model once prepared is
tested against the "testing" dataset.
DATA SCIENCE JOBS ROLES
Business Analyst:
Role:
• This professional need to improves business processes. He/she as an intermediary between the business
executive team and IT department.
Languages:
• SQL, Tableau, Power BI and, Python
TOOLS FOR DATA SCIENCE
• Data Science is the area of study which involves extracting insights from vast amounts of data by the use of various scientific
methods, algorithms, and processes.
• Statistics, Visualization, Deep Learning, Machine Learning, are important Data Science concepts.
• Data Science Process goes through Discovery, Data Preparation, Model Planning, Model Building, Operationalize,
Communicate Results.
• Important Data Scientist job roles are: 1) Data Scientist 2) Data Engineer 3) Data Analyst 4) Statistician 5) Data Architect 6)
Data Admin 7) Business Analyst 8) Data/Analytics Manager
• R, SQL, Python, SaS, are essential Data science tools
REFERENCES
• Jeff Leek (12 December 2013). "The key word in "Data Science" is not Data, it is Science“
• https://www.edureka.co/blog/data-science-applications
• https://www.guru99.com/data-science-tutorial.html