Welcome to Scribd!

Business Report Problem 2

Uploaded by

0% found this document useful (0 votes)

270 views10 pages

This document discusses analyzing inaugural speeches from three US presidents using NLTK. It includes: 1. Importing the speeches from the NLTK inaugural corpus and finding the number of characters, words, and sentences for each. 2. Removing stopwords from the speeches. 3. Identifying the top 3 most common words in each speech after removing stopwords. 4. Creating word clouds to visualize the speeches with stopwords removed.

Original Description:

Buisness report of Document

Original Title

Business Report Problem 2.Docx

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

270 views10 pages

Business Report Problem 2

Uploaded by

gowtham

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 10

Search inside document

Problem 2:

In this particular project, we are going to work on the inaugural corpora from the nltk in
Python. We will be looking at the following speeches of the Presidents of the United States
of America:
1. President Franklin D. Roosevelt in 1941
2. President John F. Kennedy in 1961
3. President Richard Nixon in 1973
 Find the number of characters, words and sentences for the mentioned documents.
 Remove all the stopwords from all the three speeches.
 Which word occurs the most number of times in his inaugural address for each
president? Mention the top three words. (after removing the stopwords)
 Plot the word cloud of each of the speeches of the variable. (after removing the
stopwords) –

Code Snippet to extract the three speeches:

"
import nltk
nltk.download('inaugural')
from nltk.corpus import inaugural
inaugural.fileids()
inaugural.raw('1941-Roosevelt.txt')
inaugural.raw('1961-Kennedy.txt')
inaugural.raw('1973-Nixon.txt')
"
Introduction:
NLTK will provide you with everything from splitting paragraphs to sentences, splitting words,
identifying the part of speech, highlighting themes, and even helping your machine
understand what the text is about.
Q1. Find the number of characters, words and sentences for the mentioned documents.
Answer : We are importing the nltk library to use the inaugural.fileds()

1|Page
After importing the text file, we would first count the total number of characters in each file
separately. Below is the code to count the char from each file. With the output along with
the screenshot.
# number of Characters in each file

# Number of words in each text file:

Below we are counting the total number of words from each file separately.
Here we are using the split() to split up the words based on space between each word and
we are counting the total number of words by using the len() function.
Output :

2|Page
# Number of Sentences.
Below we are counting the total number of sentence in each text file, by using lambda
function. We are using pd.Dataframe to move the data as dictionary and then with lambda
function we are checking each sentece which ends with “.” Using endswith() function and
the below code and output is as below.

Q2. Remove all the stopwords from all the three speeches.

Answer: We would use the library from nltk.corpus import stopwords

from nltk.tokenize import word_tokenize.
We need these to remove all the English predefined words from each text file separately and
with the help of tokenize we would separate each word and remove all the words from the
text file.

3|Page
Q3. Which word occurs the most number of times in his inaugural address for each
president? Mention the top three words. (after removing the stopwords)

Answer: We have already removed the stopwatch in previous code using stopwords.
Now we are loop to look for any word and count the total number of occurrences. And we
see from Roosevelt file we have the below words which are highly used in during the speech
by president.
Top 3 Words : Nation , Know, Spirit.

Keendy’s Speech we see the above as top 3 words.

4|Page
In Nixon’s Speech we see the below as top 3 words

Q4. Plot the word cloud of each of the speeches of the variable. (after removing the
stopwords)
Answer: Word Cloud is a data visualization technique used for representing text data in
which the size of each word indicates its frequency or importance. Significant textual data
points can be highlighted using a word cloud. Word clouds are widely used for analysing data
from social network websites.
Here we are creating the wordcloud for Roosevelt speech and we have imported the
wordcloud by importing libraries. We are also making sure to remove all the substrings
appearing the filtered data.

5|Page
6|Page
Output : Image is cut inbetween could not take single shot so. In python code the output is
good.

Wordcloud for Kenndy speech

7|Page
OutPut for Kenndy Speech :

8|Page
Wordcloud for Nixon Speech

9|Page
10 | P a g e

Points
Document1 page
Points
Being Indian
0% (6)
ANOVA
Document1 page
ANOVA
Being Indian
33% (3)
FinalReport Life Insurance
Document34 pages
FinalReport Life Insurance
ShreyaPrakash
75% (4)
Computer Programming For Beginners - Fundamentals of Programming Terms and Concepts
Document156 pages
Computer Programming For Beginners - Fundamentals of Programming Terms and Concepts
Real Notebook
No ratings yet
Problem Statement
Document17 pages
Problem Statement
gowtham
100% (1)
Arnab Chowdhury DM
Document14 pages
Arnab Chowdhury DM
ARNAB CHOWDHURY.
75% (4)
Analysis of Transport Choice of Employees - A Project On Machine Learning
Document24 pages
Analysis of Transport Choice of Employees - A Project On Machine Learning
Shyam Kishore Tripathi
100% (10)
Time Series Forecasting - SoftDrink - Business Report
Document37 pages
Time Series Forecasting - SoftDrink - Business Report
Divjyot
67% (3)
Predictive Modeling PDF
Document49 pages
Predictive Modeling PDF
preeti
100% (2)
SQL Prject
Document8 pages
SQL Prject
gowtham
No ratings yet
Linear - Regression - Assignment: Problem Statement
Document24 pages
Linear - Regression - Assignment: Problem Statement
rakesh sandhyapogu
100% (3)
Problem Statement 1
Document17 pages
Problem Statement 1
SHYAM VIVIN
100% (1)
Prob 3
Document2 pages
Prob 3
shilpa
No ratings yet
Data Mini Proj
Document44 pages
Data Mini Proj
Zohaib Imam
100% (2)
MRA Project Milestone 2
Document31 pages
MRA Project Milestone 2
Puvya Ravi
100% (2)
Harshini Week 8 Doc PDF
Document10 pages
Harshini Week 8 Doc PDF
Swathi Mithai
No ratings yet
DVT Alternate Project
Document1 page
DVT Alternate Project
Charit Sharma
50% (2)
DM Gopala Satish Kumar Business Report G8 DSBA
Document26 pages
DM Gopala Satish Kumar Business Report G8 DSBA
Satish Kumar
100% (2)
Vaibhav Kumar MRA Project Milestone 1
Document29 pages
Vaibhav Kumar MRA Project Milestone 1
gowtham
100% (3)
Business Report Machine Learning-1
Document60 pages
Business Report Machine Learning-1
Yogesh Kulawade
100% (6)
ML Ts Proj
Document58 pages
ML Ts Proj
Rasmita Mallick
100% (8)
Palash Bhai - Machine Learning Assignment
Document18 pages
Palash Bhai - Machine Learning Assignment
PalashKulshrestha
100% (1)
TSA Business Report
Document27 pages
TSA Business Report
Sudesh
0% (1)
Project Report - 2feb20
Document6 pages
Project Report - 2feb20
E421660
67% (3)
Problem 2
Document10 pages
Problem 2
Surabhi Kulkarni
100% (1)
Predictive Modelling Project 1 PDF
Document38 pages
Predictive Modelling Project 1 PDF
preeti
50% (2)
Machine Learning Project: Problem 1
Document26 pages
Machine Learning Project: Problem 1
manas vikram
50% (2)
Problem 1:: Readingcsv PD Read - Excel (Readingcsv) Readingcsv Head
Document18 pages
Problem 1:: Readingcsv PD Read - Excel (Readingcsv) Readingcsv Head
Pratigya pathak
No ratings yet
Machine Learning Project - Sapan Parikh
Document12 pages
Machine Learning Project - Sapan Parikh
Sapan Parikh
100% (1)
Project ML
Document36 pages
Project ML
ANIL
100% (4)
Machine Learning Business Report - Compress (AutoRecovered)
Document69 pages
Machine Learning Business Report - Compress (AutoRecovered)
Deepanshu Parashar
100% (2)
Predictive Modelling Project - Business Report
Document23 pages
Predictive Modelling Project - Business Report
gagan verma
100% (1)
Data Visvilization Project Boston-Condo-Sales
Document61 pages
Data Visvilization Project Boston-Condo-Sales
KATHIRVEL S
No ratings yet
ML ProjectReport-Sonali Joshi
Document38 pages
ML ProjectReport-Sonali Joshi
sonali
100% (1)
Project Report
Document36 pages
Project Report
Akshaya Kennedy
100% (3)
Cart-Rf-Ann: Prepared by Muralidharan N
Document33 pages
Cart-Rf-Ann: Prepared by Muralidharan N
rakesh sandhyapogu
50% (2)
FRA Assignment - India Credit Model
Document14 pages
FRA Assignment - India Credit Model
psyish
No ratings yet
ML Project Report
Document35 pages
ML Project Report
veerabhadra
100% (2)
Car Transport Machine Learning
Document28 pages
Car Transport Machine Learning
Satish Patnaik
88% (8)
MRA Project Milestone 1 PDF
Document1 page
MRA Project Milestone 1 PDF
Rekha Rajaram
No ratings yet
Predective Modellig Project
Document18 pages
Predective Modellig Project
manas vikram
100% (1)
Report - Project8 - FRA - Surabhi - Report
Document15 pages
Report - Project8 - FRA - Surabhi - Report
Surabhi Sood
100% (1)
Predictive Modelling Project Gloria Susan Raju 11 APR 2021 PDF
Document56 pages
Predictive Modelling Project Gloria Susan Raju 11 APR 2021 PDF
preeti
No ratings yet
Cart-Rf-ANN: Prepared by Muralidharan N
Document16 pages
Cart-Rf-ANN: Prepared by Muralidharan N
Krishnaveni Raj
0% (1)
Mra Project1 - Firoz Afzal
Document20 pages
Mra Project1 - Firoz Afzal
Kkvsh
50% (4)
Final Project ML Nikita Chaturvedi 03.10.2021 Text Analytics
Document32 pages
Final Project ML Nikita Chaturvedi 03.10.2021 Text Analytics
Nikita Chaturvedi
No ratings yet
Financial Risk Analysis Project Report Financial Risk Analysis Project Report
Document29 pages
Financial Risk Analysis Project Report Financial Risk Analysis Project Report
Nishank Agarwal
100% (2)
Jupyter Notebook Project CART RF ANN
Document41 pages
Jupyter Notebook Project CART RF ANN
Nikita Chaturvedi
100% (1)
DVT Group Assignment PDF
Document14 pages
DVT Group Assignment PDF
Anirban bhattacharya
100% (1)
Predective Modelling Project Business Report
Document58 pages
Predective Modelling Project Business Report
Ruhee's Kitchen
50% (2)
NIrupam Agarwal Business Report-ML
Document23 pages
NIrupam Agarwal Business Report-ML
Nirupam Agarwal
100% (1)
Assignment Report - Advanced Statistics
Document12 pages
Assignment Report - Advanced Statistics
Rahul
No ratings yet
Time Series Project
Document2 pages
Time Series Project
Tina
50% (4)
Business Report TSF - Rose DataSet
Document52 pages
Business Report TSF - Rose DataSet
Charit Sharma
100% (3)
Capstone Grp6 PREDICTING INSURANCE RENEWAL PROPENSITY v3
Document24 pages
Capstone Grp6 PREDICTING INSURANCE RENEWAL PROPENSITY v3
Avnika Mehta
No ratings yet
Predictive Modeling
Document22 pages
Predictive Modeling
diptidp
100% (1)
Sunira - Predictive Modeling
Document65 pages
Sunira - Predictive Modeling
Deepanshu Parashar
100% (1)
PM ProjectJune - 2021
Document33 pages
PM ProjectJune - 2021
Abhishek Roy
100% (1)
Project Predictive Modeling
Document69 pages
Project Predictive Modeling
yuktha
50% (2)
Lifi
Document16 pages
Lifi
Ankita Mishra
100% (1)
Project Report - Advanced - Stats - Final PDF
Document25 pages
Project Report - Advanced - Stats - Final PDF
Bibin Vadakkekara
No ratings yet
Assignment Report - Data Mining
Document24 pages
Assignment Report - Data Mining
Rahul
No ratings yet
Advanced Statistics Jupyter File PDF
Document56 pages
Advanced Statistics Jupyter File PDF
Nagarajan Thandayutham
100% (2)
Problem 2 Businessreport ML
Document9 pages
Problem 2 Businessreport ML
Rajesh
No ratings yet
Gowtham Mra 2
Document18 pages
Gowtham Mra 2
gowtham
No ratings yet
Website KPI Dashboard: Revenue Video Views Social Engagement Conversion % Cart Abandon %
Document7 pages
Website KPI Dashboard: Revenue Video Views Social Engagement Conversion % Cart Abandon %
debojyoti
No ratings yet
Pcms Sap Link
Document2 pages
Pcms Sap Link
masumidb
No ratings yet
RD Licensing Demystified - How It Works - TechNet Articles - United States (English) - TechNet Wiki
Document2 pages
RD Licensing Demystified - How It Works - TechNet Articles - United States (English) - TechNet Wiki
markonitogen
No ratings yet
Factory Acceptance Test Plan: Developed by Lenel Systems International Inc
Document20 pages
Factory Acceptance Test Plan: Developed by Lenel Systems International Inc
pasanac88
100% (1)
DeviceDefaults-7 1 5 32017-1
Document2 pages
DeviceDefaults-7 1 5 32017-1
juanzzi
No ratings yet
Bitter Manual
Document2 pages
Bitter Manual
JonEaton
No ratings yet
Trust Manuals Usermanuals Va 1.0 PDF
Document72 pages
Trust Manuals Usermanuals Va 1.0 PDF
Adrian Doru
No ratings yet
Training TETRA
Document21 pages
Training TETRA
Ala'a Abdulla
No ratings yet
Different ERD Notations
Document11 pages
Different ERD Notations
Choey ShadowWalker
No ratings yet
SC-ETC-001 5.11 - Module 1 - Multi-Server Architecture
Document45 pages
SC-ETC-001 5.11 - Module 1 - Multi-Server Architecture
Fernando Quintero Casadiegos
No ratings yet
Svmon
Document38 pages
Svmon
rajeevkghosh
No ratings yet
Manual: Global Drive
Document18 pages
Manual: Global Drive
Драгиша Небитни Трифуновић
No ratings yet
Question Bank For CSIT140
Document2 pages
Question Bank For CSIT140
ruksharyashmin
No ratings yet
Downloading Amazon Chime: Please See The Chime FAQ's Page: Https://answers - Chime.aws
Document1 page
Downloading Amazon Chime: Please See The Chime FAQ's Page: Https://answers - Chime.aws
Sarah Majumder
No ratings yet
Turn Photo Into LP Character
Document1 page
Turn Photo Into LP Character
twehtesdfazdhadh
No ratings yet
G3 Station User Manual en Rev02
Document35 pages
G3 Station User Manual en Rev02
Clayton Martinez
No ratings yet
Networking Essentials
Document88 pages
Networking Essentials
Charles Nyaga Mubiax
No ratings yet
Sunlite Gige: Key Features Test Features
Document2 pages
Sunlite Gige: Key Features Test Features
Nelson
No ratings yet
4th Sem
Document3 pages
4th Sem
Amit Prakash
No ratings yet
Essbase Calc Scripts From FDMEE
Document7 pages
Essbase Calc Scripts From FDMEE
sen2nat5693
No ratings yet
Introduction To SAP Business One
Document29 pages
Introduction To SAP Business One
Moussa
0% (1)
7SR11 and 7SR12 - Argus Complete Technical Manual
Document398 pages
7SR11 and 7SR12 - Argus Complete Technical Manual
Jagath Prasanga
No ratings yet
Payroll Synopsis
Document14 pages
Payroll Synopsis
learning beginner
No ratings yet
Lab 2 Sol - Identifying IP Addresses
Document3 pages
Lab 2 Sol - Identifying IP Addresses
rnd batran
No ratings yet
Self-Blocking Flip-Flop Design: X. Li, S. Jia, X. Liang and Y. Wang
Document3 pages
Self-Blocking Flip-Flop Design: X. Li, S. Jia, X. Liang and Y. Wang
ayu881991
No ratings yet
KHX16C10B1R/8: Memory Module Specifi Cations
Document2 pages
KHX16C10B1R/8: Memory Module Specifi Cations
Reynaldo Coaquira
No ratings yet
Ms Odrawxml
Document293 pages
Ms Odrawxml
SebiMihai
No ratings yet
New Microsoft Office Word Document
Document20 pages
New Microsoft Office Word Document
Vinay Kumar
No ratings yet
New MPMC Lab 2015 16 2 - 0
Document101 pages
New MPMC Lab 2015 16 2 - 0
vijaykannamalla
No ratings yet