Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Jobs of manager:
1. Data Analysis
2. Data reporting
Q: What is spreadsheet?
192.168.188/claroline
End term 60
Quiz 30
Assignment 10
Techniques:
1. Forecasting
2. Regression
3. Clustering
4. Correlation
Tools:
1. Excel
2. R
3. Python
4. SPSS
5. SAS
But you need to know business (domain expertise) also to make analysis.
Smart analytics:
Editor’s Choice
Shopper’s Choice
Personalized recommendation:
e-Commerce
One page or sheet in excel is a worksheet. Max. sheets that can be added: 255
Intersection of a row and a column is called a cell. The name of a cell is the
address itself.
=Count (Range): In a given range, how many cells are filled by a number.
=Counta(Range): In a given range, how many cells are filled (by text, junk
character, special character, numbers).
Circular Referencing error: Blue arrows shown.
How to come out of it? Go to file, options, formulas tab, check the textbox of
enable iterative calculations.
Key shortcuts:
Press F3: Opens the paste list dialog box (of whole workbook)
Ctrl+shift+2: time
Ctrl+shift+3: date
Ctrl+shift+4: currency
Ctrl+shift+5: %
Ctrl+shift+6: Exponential
Fill with no. from 1 to 10000: Choose option fill (below auto sum button) from
home tab.
It checks the immediate left column; upto which the immediate left column is
filled; rand function fills the current column.
Unit Price: Need a real number; value between 100 and 200.
Randbetween(100,199) + rand().
Revenue in two decimal places: If you apply formatting, if the sheet is sent to
someone place, the formatting is lost.
Caution: You should always use some formula rather than formatting.
Formatting doesn’t change the storing of the data in memory.
Eg: Round (32.673,0)= 33. More than 5, upper side and less than 5, lower side.
Round(32.673,2)=32.67
Roundup(): rounds to the next higher number upto the decimal position
specified by you without seeing the number.
Rounddown(): rounds to the next lower number upto the decimal position
specified by you without seeing the number.
Rename the cell from name bar: Do press enter key after changing the name
in name bar.
1. Name can be alphanumeric but it should start with an alphabet and not a
number. Eg: cat123 is a valid name [not valid because name of a cell] but
123cat is not.
2. Special symbols(space , !, @, $, #) should not be there. [“_” is allowed].
3. A cell name cannot be another cell address. (abc123 is a valid cell
address; so it can’t be used.)
Forumulatext(cell address)
Large(range, k): k is 2nd largest or 3rd largest and so on. K=1: max
Protect Excel:
How:
Hide a formula:
Circular Referencing:
Auditing Tools
2 most imp functions to extract information about the current excel sheet.
INFORMATION CATEGORY
INFO
Hide/unhide a sheet
Name of workbook:
Name of worksheet:
Any function that starts with ‘IS’ only returns two values: TRUE or FALSE.
Assignment 1:
Error Types:
7 types of errors:
1. #error (###): size of cell is insufficient; insufficient space: double click
on the border of two cells or adjust it manually.
2. #DIV/0!: Division by zero
3. Name error (#NAME?): Neither a built-in function nor user defined
variable.
Check the spelling or define a range.
4. #N/A: Not Applicable (formula doesn’t exist in the mentioned cell)
5. #VALUE: Doing operation on incompatible data types.
6. #REF: reference error
7. NULL error (#NULL!): if the output is null after doing some operation,
this error is returned.
Egs:
B1+B2 B3
Space works as an intersection operator in excel. It gives the intersection
value between two operands.
8. #NUM!: SQRT(-5): Number is not real; so this error comes. Or finding
log of a negative number.
Egs:
When computation is a very big number or answer is not defined.
=5^3125
=10^10245 #NUM!. Very big number which excel can’t calculate.
SQRT(-5):
Log(-ve number)
To calculate the length of characters in a cell:
=len(c2)
Trim: Removes the extra white spaces before starting character and after
the last character; and also in between two words, only one valid space
will be there.
Extract first name of every cell: all the characters before the occurrence
of first white space.
=LEFT(E2,FIND(" ",E2,1)-1)
Start_num: start finding the white space or text to be found after given
position.
Extract numbers/qty.:
If there are k white spaces, the kth occurrence is the last white space.
Concatenate(A2, “ “, B2)
REPT(“m”, 10): Repeats a given character for the given no. of times.
To unfreeze:
F3: Paste list; It gives the list of range names in the whole workbook.
In sheet1: Find function to check for ! mark. Then extract first name; that
will tell all sheets and then we can use counta.
IF Function
3 ARGUMENTS:
Logical Operators:
>
<
>=
<=
!=
Using IF function:
Checking eligible or not eligible for a loan:
If (age>25, “eligible”, “not eligible”)
Logical Conditions:
Assignment:
1. =IF(AND(Age>35,Owns_House="NO",Type_of_employment="GOV
T"),"Eligible", "Not Eligible")
OR
=IF(Age>35,IF(Owns_House="NO",IF(Type_of_employment="GOV
T","Eligible","Not Eligible"),"Not Eligible"),"Not Eligible")
2. =IF(OR(Annual_Income>1500000,Owns_House="YES"),"Eligible","
Not Eligible")
4.
5. =countif(range, Logical condition)
=countif(range, “Eligible”)
COUNTIFS:
N. Total no. of customers who are either male or their age is greater
than 40 years.
Soln:
=countif(Age, “>40”)+countif(Gender, “M”)- countifs(Age,
“>40”,Gender, “M”)
8. =countif(pan,”*T”)
11.
OR
=countif(pan_no, rept(“?”,10))
Soln: =max(ann_inc)
=countif(ann_inc,A2)
Criteria:
SUMIF: sum a certain range if and only if the given criterion is met.
Functions covered:
Info
Text
Count
Logical
Mathematical
CAUTION: the first column of look up table should contain look up value.
Exact match: FALSE
Col Index: from which column you want the data. It should always be a positive
number.
Select list of values; select data source and list will be created.
Error alert:
Warning:Yes or No
Information: OK
Input Message:
For vlookup on numeric value, the table should be in ascending order for the
first column in lookup table.
Hlookup:
=index(array, position)
=MATCH(AG11,A1:O1,0)
=vlookup(AE12,A1:O10514,MATCH(AG11,A1:O1,0),FALSE)
Offset:
Go up in no. of rows:-
Go right in columns: +
Go left in columns: -
Sum of 6 cells:
=SUM(OFFSET(D10,-5,-1,3,2))
A: Text to column get the last filled column in a given row [because it
contains the no. of copies]use offset
To get the last filled column:
=offset(B2,0,B2)
Indirect:
Always looks for text data which can be logically transformed into a valid cell
address.
[Because the sum function (for range) and summing all the cell addresses
change the cell address of the value. But indirect function works best.]
=INDIRECT(C$1 &"!"&$B2)
CAUTION: all the multiple workbooks should be opened in the background for
the function to work.
=INDIRECT("["&C$1&".xlsx]sheet1!"&$B2)
Address:
Lookup:
1enterctrl+shift+301-jan-00
Text function:
=TEXT(A3,"mmmm"): July
=TEXT(A3,”mmm”): Jul
Date, month and year given in different cells: to make that into date
Time functions:
Suppose a project starts today. The project takes 200 days to complete.
=today()+1: 21-07-2017
“0000000”: first zero stands for Monday, 2nd for tues, 7th zero Sunday.
List of activities given, start date, end date: Calculate no. of working days.
If 7 days a week,
If 5 day week:
=networkdays(A22,A26): It is inclusive of both the dates: start and end date. So,
no need to subtract 1.
If customized weekend:
=networkdays.intl(A22,A28, [weekend]):
Session 9
=datedif(start_date, end_date,”ym”): how many left over months after the last
completed year.
=datedif(start,end,”md”): how many days left over after the last completed
month.
What if analysis: Decision making tool in Excel.
1. Data Table: Used to study the effect of changing input variable on output
variable.
2 variants:
a) One-input data table: Only one input variable is changing and effect
can be seen on n output variables. [1:N]
b) Two-input data table: 2 input variables are changing and we can see
the effect on 1 output variable. [2:1]
2. Goal seek: Output variable is given and we want to see the effect on
input variable.
Working in dt worksheet,
Map revenue to D5, VC to D6 and so on. [These are my output variables]
Q: How many cups of tea should I sell that I get break-even point.
A: Data Goal seek
Set cell: profit (D7)
To value: 0 [Bcz break even, profit=0]
By changing: demand (D2)
Scenario Manager:
Year 1 price 20 10 5
Output variables:
NPV
Name: best
Press Add
Name: worst
Web page
Access database
Text file
Session 12
3 areas:
VALUE ZONE:
Unique value of that particular column will be displayed. If there are duplicate
values in that column, only the unique value is displayed.
Rows: Store
3: Product category for each product and store pair the average revenue
earned?
Store
Count (order_number)
A: It is fixed with respect to one point. Here the cell is A3 to which it is fixed.
Hierarchy of day:
Days WeekMonthQuarterYear
Hierarchy of country:
CityDistrictStateRegionCountry
Product category is higher in hierarchy than product. So, change their order in
row/column zone. It will give good presentation.
Rows: date
In Grouping dialog box, start date and end date is already filled.
N. Last year, during Mega sale in Diwali days, my server got down.
Analyse my weekly traffic now.
Rows: Age
Start: 18
End: 29
By: 12
CAUTION: Ungroup.
11. Customers interact with the store multiple times [physical store or
website]. In which interaction he has spent max value?
13. Last visit date, Times he visited the store [Frequency], Total amount
spent by him.
14. Total revenue by each store for different products and payment was
received by cash.
Caution 2: Right click on the header of slicer/timeline and click clear filter of
slicer and timeline.
17. What % of total revenue comes from Bhubaneshwar store by body care
product.
% of Grand Total.
18. Of the total revenue earned from body care, what percentage of
revenue comes from Patna Store?
Rows: Quarter
Select the three items by pressing ctrl key. Right click group.
Group 1
In pivot table,
ColumnFields
If total revenue earned from customer is > 80000, then give him gift
voucher=10% of his revenue amount else zero.
It doesn’t affect the original data; only added as field in pivot table.
Cross-selling of a product.
List formulas
New row
New column
Enter formula in one cell in the column; the whole column gets populated
automatically.
Consolidate
Fn: sum
Reference: select the data part in east and west workbooks respectively.
Add reference
Create link from data source
Conditional Formatting
New Rule:
2. Select the entire row: so, select the whole data here.
=($F2+$G2+$H2)>=$K$6
Even if I now change the target value in K6 cell, the rule automatically
adjusts according to it.
3. Color only the name of the employee:
Select only employee column from cell A2
=mod(column(),2)
11.Alternate row color:
=mod(row(),2)
Color in green the max value of respective columns & min in red
6 worksheets