Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Ans: Data processing is a process of skilfully organising of data for the purpose of data
analysis and interpretation. Data processing can be done manually when the data collected is
limited or it can be done mechanically when the collected data involve huge quantities. Data
processing is the intermediary stage between data collection and data analysis. It needs to be
planned at the stage of research design.
Research Findings
Drawing Conclusions
Recommendations
1. Editing
2. Coding
3. Classification
4. Tabulation
5. Graphic Presentation
1. EDITING:
Editing is the process of checking errors and omissions in data collection, and making
corrections, if required. Editing is required when:
There is inconsistency in responses given by the respondents.
Respondents may provide incorrect or false responses.
Some vague/ incomplete answers given by the respondents.
No responses are provided by the respondents for certain questions.
Types of Editing:
Field Editing
Central Editing
2. CODING:
It is a process of assigning codes to the various statements or questions in the
questionnaire. Coding is specially required when the sample size is large and there is
large amount of data collection from respondents. Coding facilitates proper tabulation
and analysis of data.
Types of Codes:
Numerical Codes
Alphabetical Codes
Alpha-Numerical Codes
3. CLASSIFICATION:
It is the process of grouping of collected data into different categories. Therefore,
coding is an element of classification. The classification can be according to different
categories: Age Group Wise, Gender Wise, Educational Level Wise, Income Group
Wise, Occupation Wise, etc.
Each of the categories can be further divided into sub-groups. For example: The age
group can be further divided into different categories such as: Children, Teenagers,
Young Adults, Middle Aged and Senior Citizens.
4. TABULATION:
It refers to transferring the classified data in a tabular format for the purpose of
analysis and interpretation. It involves sorting of data into different categories and
counting the number of responses that belong to each category.
Methods of Tabulation:
Manual Tabulation
Mechanical Tabulation
5. GRAPHIC PRESENTATION:
The research data needs to be presented effectively for quick and clear understanding.
Bar graphs, pie charts, line graphs and other pictorial devices are an excellent means
to present the data.
Ans: A single value within the range of the data that is used to reveal the general tendency
and to represent the entire data is known as a measure of central tendency.
George Simpson and Fritz Kafka state that a measure of central tendency is a typical value
around which the other values congregate.
Croxton and Cowden define an average is a single value within the range of the data that is
used to represent all of the values in the series. Since the average is somewhere within the
range of data, it is sometimes called as a measure of central value.
Objectives of Averaging:
It can be easily calculated in the case of distributions containing open end class-
intervals:
Sometimes, we are bound to use open-end classes for classification. Even in such
situations an average can be calculated very easily without any assumption regarding
such open-end classes.
MATHEMATICAL:
1. Arithmetic Mean
2. Geometric Mean
3. Harmonic Mean
POSITIONAL:
1. Median
2. Mode
1. ARITHMETIC MEAN:
It is the most popular measure of central tendency, as it is quicker and easier to
compute the average. It refers to the value obtained by dividing the sum of the values
of all items by the total number of items.
Formula:
Arithmetic mean is calculated with the following formula:
2. GEOMETRIC MEAN:
The geometric mean of any set of n numbers is the n th root of the product of the
numbers. If there are two times or values, square root of the product of the two values
is the Geometric mean, if there are three values, the cube root is the Geometric mean
and so on.
It is the most appropriate average to be used when it is desired to give more weightage
to smaller items and small weightage to larger items.
3. HARMONIC MEAN:
Harmonic mean of a series is the reciprocal of the arithmetic average of the reciprocal
of the values of its various items.
n
Harmonic Mean = ______________________________
(1/x1) + (1/x2) + + (1/xn)
4. MEDIAN:
Median is the middle value of a series when the data of a series is arranged in
ascending or descending order. It divides the series in two equal parts.
5. MODE:
The mode is defined as the value of a variable which occurs most frequently.it is the
value which is repeated number of times or with the highest frequency in the series.
Croxton and Cowden define The mode of a distribution is the value at the point
around which the items tend to be most heavily concentrated.
A. M. Tuttle defines Mode is the value which has the highest frequency density in
its immediate neighbourhood.
Q.3. Explain the different methods of determining correlation.
Ans: The methods of determining correlation between variables can be divided into two
groups:
Algebraic methods which include rank differences, product moment, least squares etc.
SCATTER DIAGRAM:
The pairs of values of X and Y are represented by dot points plotted on the graph paper. The
graph is called a Scatter Diagram.
When X and Y are meaningfully linked with each other their scatter diagrams may tally with
any one of the above standard scatter diagram.
Spearman Rank Correlation Coefficient uses ranks to calculate correlation. It is also called as
correlation coefficient between ranks. It is sometimes denoted by rs.
E.g. A talent contest where 5 competitors are evaluated by 2 judges A & B. Usually judges
award numerical scores for each contestant after his/her performance.
Spearman Rank Correlation Coefficient can indicate if judges agree to each others views as
far as talent of the contestants are concerned. This correlation coefficient can indicate if the
judges are unanimous.
Interpretation of numerical values:
The numerical value of correlation coefficient, rs, ranges between -1 and +1. The correlation
coefficient is the number indicating as to how the scores are relating.
In general,
Rs = 0 indicates no agreement
Assigning Ranks:
In order to compute Spearman Rank Correlation Coefficient, it is necessary that the data can
be ranked. Ranks are assigned separately for the 2 judges starting either from the highest or
from the lowest score.
Spearman Rank Correlation Coefficient tries to assess the relationship between ranks without
making any assumptions about the nature of their relationship. It is a non-parametric measure
of correlation.
It is one of the measure of correlation which quantifies the strength as well as direction of
such relationship.
Two variables are said to be correlated if change in one variable is accompanied by change in
the other either in the same or reverse direction.
Q.4. Distinguish between regression and correlation tool of analysis
Ans: The two techniques are directed towards a common purpose of establishing the degree
and the direction of relationship between two or more variables but the methods of doing are
different. However, there are some basic differences in the two approaches which have been
summarised below:
The first step is to state the research problem. The research problem needs to identify the
population of interest, and variables of under investigation.
The population of research refers to the students, and the variables include the teaching
methods and the marks.
This step enables the researcher not only define what is to be tested but what variables will be
used in sample data collection. The type of variables, whether categorical, discrete or
continuous, further defines the statistical test which can be performed on the collected data.
The research problem or question is converted into a null hypothesis and an alternative
hypothesis. The hypothesis are stated in such a way that they are mutually exclusive which
means that if one is true then the other must be false.
Null Hypothesis: It is a statement that declares the observed difference is due to chance. It is
the hypothesis the researcher hopes to reject or disprove.
It is clear that Null Hypothesis is a hypothesis of no difference. The main problem of testing
hypothesis is to accept or to reject the Null Hypothesis.
The Alternative Hypothesis specifies a definitive relationship between two variables. Only
one Alternative Hypothesis is tested against the Null Hypothesis.
3. Significant Level:
After formulating the hypothesis, the researcher must determine a certain level of
significance. The confidence with which a Null Hypothesis is accepted or rejected depends
on the level of significance.
4. Test Statistic:
A statistic used to test the Null Hypothesis. The researcher needs to identify a test statistic
that can be used to assess the truth of the Null Hypothesis. It is used to test whether the Null
Hypothesis set up should be accepted or rejected.
Test Statistic is calculated from the collected data. There are different types of test statistics.
Every test in statistics indicates the same. Based on the sample data, it gives the probability
that can be observed. When the p-value is low, it means that the sample data are very
significant and it indicates that the null hypothesis is wrong. When the p-value is high, it
suggests that the collected data are within the normal range.
The region of acceptance is a range of values. If the test statistic falls within the region of
acceptance, the Null Hypothesis is not rejected. The region of acceptance is defined so that
the chance of making a Type I error is equal to Alpha level of significance.
The set of values outside the region of acceptance is called Region of Rejection. If the test
statistic falls within the region of rejection, the Null Hypothesis is rejected. It is said that the
hypothesis has been rejected at the Alpha level of significance.
(Alpha the probability the researcher is willing to take in falsely rejecting a true Null
Hypothesis)
A hypothesis test may be one-tailed or two-tailed. Whether the test is one sided or two sided
depends upon an alternative hypothesis and the nature of the problem.
A test of a Statistical Hypothesis, where the region of rejection is on only one side of
sampling distribution, is called one tailed test. In one tailed test, the test statistic for rejection
of Null Hypothesis falls only in one side of sampling distribution curve.
Whether to apply one tailed test or two tailed test depends upon the nature of the problem.
One tailed test is used when the researchers interest is primarily on one side of the issue.
In two tailed test, the test statistic for rejection of Null Hypothesis falls on both sides of the
sampling distribution curve.
The statistical analysis shows that the significance level is below the cut off value we have
set, we reject the null hypothesis and accept the alternative hypothesis. If significance level is
above the cut off value, we fail to reject null hypothesis and cannot accept the alternative
hypothesis.
Ans: Parametric Test is a statistical test that depends on an assumption about the distribution
of data, that the data are normally distributed. When considering a normal distribution of a
population these features are known as Parameters. Parametric analysis relies on the data
being normally distributed so that an estimation of the underlying populations parameters
can be made. These can then be used to test the null hypothesis. As only quantitative data can
have a normal distribution, it follows that parametric analysis can only be used on
quantitative data.
Provided they are appropriately used, parametric test derive more information about the
whole population than non-parametric ones.
Ans: Chi Square is the measure which evaluates extent to which a set of the observed
frequencies of a sample deviates from the corresponding set of expected frequencies of the
samples. It is the measure of aggregate discrepancies between actual and expected
frequencies. It was first discovered by Helmet in 1875. Karl Pearson derived it independently
in 1990 and applied it as a test of goodness of fit. It is used as a test statistic in testing
hypothesis that provides the theoretical frequencies with which observed frequencies are
compared.
If O denotes the observed frequency and E the corresponding expected frequency of a class
interval or cell, then we define Chi Square by the relation.
Any statistical test that uses the Chi Square distribution can be called Chi Square Test. It is
applicable both for large and small samples depending on the context.
Eg. Suppose a person wants to test the hypothesis that success rate in a particular English test
is similar for students studied in Private Schools and Government Schools.
If we take random sample size 80 students and measure both types of schools as well as
success/failure status of each of the student, the Chi Square test can be applied to the test of
hypothesis.
There are different purposes for the test. They are as follows:-
The Chi Square test for single variance has an assumption that the population from which the
sample has been normal. This normality assumption need not hold for chi square goodness of
fit test and test for independence of attributes.
When implementing these two tests one has to ensure that expected frequency in any cell is
not less than 5. If it is so, then it has to be pooled with the preceding or succeeding cell so
that expected frequency of the pooled cell is atleast 5.
4. Calculate the difference between the observed and expected frequencies correspondingly.
6. Ascertain the appropriate value from the table at a particular level of significance.
Ans: The essential characteristics of a good research report are stated as follows:
1. Informative:
The research report must be informative. The research report must provide adequate
information to the concerned authorities to take appropriate decisions. Inadequate
information may not facilitate proper decision making on the part of the management.
2. Clarity:
The report must be written in simple and lucid language. The reader should find no difficulty
in understanding the contents of the report. The reader should be able to understand the
contents in the first reading itself. Technical language should be used only in exceptional
areas. Ambiguous words and phrases should be avoided in the reports.
3. Concise:
The report must be written briefly. Maximum information must be provided in minimum
words. As far as possible, lengthy reports must be avoided. This is because, lengthy reports
are often confusing and they require a lot of time for the reader to note the contents.
4. Accuracy:
The report should contain accurate facts and figures. This is because managers base their
decisions on the facts and figures of the reports. If the reports are inaccurate or contain wrong
facts and figures, then it will lead to poor decisions.
5. Reliability:
A good report must be reliable. The information in the report must be collected from reliable
sources. While collecting information for the purpose of preparing reports, care must be taken
to check the validity and genuineness of the source. Reports must not contain out dated data.
If the reports are based on secondary sources, the researcher must check the genuineness of
the data.
6. Objectivity:
The report must be objective. It must contain only objective facts and figures. The reports
must not be biased or subjective. The report must not be influenced by personal bias of the
researcher. Personal bias adversely affects the decision making. Therefore, the research must
be objective especially in the case of commercial research.
7. Logical Arrangement:
The research must be written in a systematic manner. The different parts of the report must be
arranged in a logical sequence as follows:
Conclusion
Recommendations
Appendix
Bibliography
Certain report findings must be kept secret. Eg. A committee may set up to look after the
malpractices of a certain manager. The committee members should not leak out the findings.
The report finds must be strictly provided only to the top management.
Certain reports must be submitted within a particular time limit. Eg. Committee report must
be reported within time limit so that suitable action can be taken.
The report must be written in a suitable format. The report must be divided into paragraphs,
preferable numbered and be given a suitable heading for each paragraph. The report must also
contain a suitable title.
Ans: There are various types of Research Report. The types of research report are as follows:
1. Technical Report:
The technical research is written in technical language. It follows a specific pattern and
consists of several sections with proper headings and paragraphs.
2. Popular Report:
It is designed for executives and other non-technical users. The reader is more interested in
knowing the findings of the research, conclusions and recommendations.
While writing this report, certain essentials must be followed such as concise and clarity,
accuracy of data, reliability of data, objectivity and not biased and logical arrangement of
different parts of report etc.
This type of research report is meant for commercial and social research because it is meant
for non-technical people, especially executives in a commercial organization.
3. Interim Report:
When there is long gap between data collection and presentation of final report, the study
may lose its importance. Therefore, the sponsor may also lose interest in the research and/or
research report. Therefore, in such a situation, the researcher may present interim report. The
interim report may also contain the first analysis of the problem and the final analysis of
certain aspects that have been completely analysed. This type of report enables the
sponsoring authority to take decisions without waiting for the full report.
4. Summary Report:
It is generally prepared for the use of general public. This report is desirable for any study
whose findings are of general interest. It is written in non-technical and simple language.
It contains a brief reference to the objective of the research, findings and conclusions. It is a
short report of two or three pages.
5. Research Abstract:
This is a summary of technical report. Technical students like engineering, medicine etc. are
usually prepared in on the eve of submitting their thesis. Its copies are sent to the university,
which in turn provides to the examiners or referees invited to evaluate the thesis.
6. Research Article:
This is designed for publication in a professional journal. If a study has two or more
important aspects that can be discussed independently, it is advisable to write two articles
rather than to include in a single article.
A research must be clearly written in concise and clear language. It must be logically
arranged as follows:
Objectives of Research
Recommendations
The Modern Language Association (MLA) establishes values for acknowledging sources
used in research paper. MLA citation style uses a simple two part parenthetical
documentation system for citing sources: Citation in the text of paper point to the alphabetical
Works Cited list that appears at the end of the paper. Together, these references identify and
credit the sources used in the paper and allow others to access and retrieve this material.
In MLA style, writers place references to sources in paper to briefly identify them and enable
readers to find them in the Works Cited list. These parenthetical references should be kept as
brief and as clear as possible.
The writer needs to give only the information needed to identify a source. Usually the
authors last name and page reference should suffice.
Place the parenthetical reference as close as possible to its source. Insert the parenthetical
reference where a pause would naturally occur, preferably at the end of a sentence.
Information in the parenthesis should complement, not repeat, information given in the text.
If you include an authors name in a sentence, you do not need to repeat it in your
parenthetical statement.
The parenthetical reference should precede the punctuation mark that concludes the sentence,
clause, or phrase that contains the cited material.
Electronic and online sources are cited just like print resources in parenthetical references. If
an online source lacks page numbers, omit numbers from parenthetical references. If an
online source includes fixed page numbers or section numbering, such as numbering or
paragraphs, cite the relevant numbers.
The Chicago Manual Style or CMS is a style guide for American English published since
1906 by the University of Chicago Press.
It is one of the most widely used and respected style of guides in the United States. CMS
deals with aspects of editorial practice, from American English grammar and use to document
preparation.
The Chicago Manual Style includes chapters relevant to the publishers of books and journals.
It is used widely by academic and trade publishers, as well as editors and authors who are
required by those publishers to follow it.
Chicago Style offers writers a choice of several different formats. It invites the mixing of
formats, provided that the result is clear and consistent.
2. Author Date
Choosing between the two often depends on subject matter and the nature of sources cited, as
each system is favoured by different groups of scholars.
The Notes and Bibliography style is preferred by many in the humanities, including those in
literature, history and the arts. This style presents bibliographic information in notes and
often, a bibliography. It accommodates a variety of sources, including esoteric ones less
appropriate to the author date system.
The author date system has long been used by those in the physical, natural and social
sciences. In this system, sources are briefly cited in the text, usually in parenthesis, by
authors last name and date of publication. The short citation is amplified in a list of
references, where full bibliographic information is provided.
Ans: Researcher must follow certain ethical norms or guidelines in conducting the research
work. The main ethical norms in research are briefly stated as follows:
Ans: There are various issues in research. Some of the ethical issues in research are
connected with the research process. The ethical issues relating to research process are
concerned with research design, sample size, data collection, data processing, data analysis
and interpretation, and so on. However, the ethical issues relating to research process can be
avoided by systematic planning for research and by following ethical norms in conducting the
research.
One of the major ethical issues in research involves PLAGIARISM. Its the presentation of
the work of another person as ones own or without proper acknowledgement.
Plagiarism includes copying of material word for word from books, journals, internet sites,
other researchs notes, etc. it could be material that is paraphrased but closely resembles the
original source. Paraphrasing is using your own words to express someone elses ideas whilst
still preserving the main ideas of the original source.
Plagiarism does not refer to words alone it can refer to copying images, graphs, tables, and
ideas. Presentation is not limited to written work. It also includes oral presentations,
computer assignments and artistic works. If you translate the work of another person into
French or English and do not cite the source, this is also plagiarism. If you cite your own
work without the correct citation, this too is plagiarism.
Therefore, one should not copy, paraphrase or translate anything from elsewhere without
stating the source of the original text.
To give your writing credibility. You show that you have gathered idea from
worthwhile sources.
To help the reader. You enable the reader to go and check and read those sources if
he/she so wishes.
To protect yourself from plagiarism. When you cite all your sources, no one can say
that you stole or copied ideas from someone else.
Sources of Others Ideas:
Direct quotations:
When you are using someone elses exact words, you need to place quotations marks ()
around the words. You also need to be careful not to rephrase or reorganize the words;
otherwise you would be guilty of misrepresenting the author. If you want to leave out part of
authors sentence you can use three ellipsis points () to show the words which have been
omitted, directly after the quotation, you should indicate where the information come from,
using one of the standard methods (such as MLA and APA) to document your source. For
more specifics, refer to the hand-outs on MLA/APA documentation, available in H 662 or AD
103 or Go to the Concordia University Libraries Citation Guides.
Paraphrasing:
Many students are unclear about paraphrasing. It is not acceptable to take the original
phrasing and to rearrange a few of the original words in order to produce a paraphrase;
neither is it acceptable to use the same sentence structure but just rephrase a few key words.
Example:
Original: Students frequently overuse direct quotation in taking notes and as a result their
overuse quotation in the final research paper. Probably only about 10% of your final
manuscript should appear as directly quoted matter. Therefore, you should strive to limit the
amount of exact transcribing of source materials while taking notes.
Acceptable paraphrase: In research papers, students often quote excessively, failing to keep
quoted material down to a desirable level. Since the problem usually originates during note
taking, it is essential to minimize the material recorded verbatim (Lester, 1976).
A plagiarized version: Students often use too many direct Quotations when they take notes,
resulting in too many of them in final research paper. In fact, probably only 10% of the final
copy should consist of directly quoted material. So it is important to limit the amount of
source material copied while taking notes (Lester, 196).
When you paraphrase, make sure to understand what the original is saying, then close the
book and write the passage in your own words. Also, note that you need to cite a source for a
paraphrase even through you did not quote from the source directly. In the examples above,
the source, Lester, is given after the paraphrase. When you are paraphrasing rather than using
exact words, mentioning the page number in the source parentheses is optional, but check
with your professor as some may prefer you to include it.
To avoid plagiarism in research one need to state the source from where the information is
obtained. In the research paper, some or most of the ideas may be from the researcher.
However, some ideas may have been borrowed from other source or from the people who
have been interviewed on subject. Therefore, one needs to correctly state the source from
which the information is obtained.
1. Paraphrase: So you have found information that is perfect for your research paper.
Read it and put it into your own words. Make sure that you do not copy verbatim
more than two words in a row from the text you have found. If you do use more than
two words together, you have to use quotation marks. We will get into quoting
properly soon.
2. Cite: Citing is one of the effective ways to avoid plagiarism. Follow the document
formatting guideline (i.e. APA, MAL, Chicago, etc.) used by educational institution or
the institution that issued the research request. This usually entails the education of
the author(s) and the data of the publication or similar information. Citing is really
that simple. Not citing properly can constitute plagiarism.
3. Quoting: When quoting a source, use the quote exactly the way it appears. No one
wants to be misquoted most institutions of higher learning frown on Block Quotes
or quotes of 40 words or more. A scholar should be able to effectively paraphrase
most material. This process take time, but the effort pays off! Quoting must be done
correctly to avoid Plagiarism allegations.
4. Citing Quotes: Citing a quote can be different than citing paraphrased material. This
practice usually involves the addition of a page number, or a paragraph number in the
case of web content.
5. Citing Your Own Material: If some of the material you are using for your research
paper was used by you in your current class, a previous one, or anywhere else you
must cite yourself. Treat the text the same as you would if someone else wrote it. It
may sound odd, but using material you have used before is called self plagiarism,
and it is not acceptable.
6. Referencing: One of the most important ways to avoid plagiarism is including a
reference page or page of works cited at the end of your research paper. Again, this
page must meet the document formatting guidelines used by your educational
institution. This information is very specific and includes the authors, data of
publication, title, and source. Follow the direction for this page carefully. You will
want to get the reference right.
Q.13. Explain the use of SPSS Statistics for Data Analysis and Reporting.
Ans: It stands for Statistical Package for the Social Sciences. SPSS Statistics is a software
package used for analysis. Long produced by SPSS Inc., it was acquired by IBM in 2009. The
current versions (2014) are officially named IBM SPSS Statistics. Companion products in the
same family are used for survey authoring and deployment (IBM SPSS Data Collection), data
mining (IBM SPSS Modeler), text analytics, and collaboration and deployment (batch and
automated Scoring Services).
The software name stands for Statistical Package for the Social Sciences (SPSS), reflecting
the original market, although the software is now popular in other fields as well, including the
health sciences and marketing.
SPSS id a widely used program for statistical analysis in social science. It is also used by
market researchers, health researchers, survey companies, govt., Education researchers,
marketing organisations, data miners, and others. The original SPSS manual has been
described as one of Sociologys most influential books for allowing ordinary researchers to
do their own statistical analysis. In addition to statistical analysis, data management and data
documentation are features of the base software.
Having collected the data, the first step is to create the data file in SPSS. To do this, open
SPSS and you will see a blank spreadsheet ready for data input.
Example: To create a data file, we can take an example. A closed ended questionnaire is
developed to understand consumer behaviour relating to readymade garments. The questions
are related to individuals age, gender, education, occupation, monthly income and brand
preference.
You could start entering data now, but it makes more sense to first define the data so that it is
easier to keep track of where you are as you enter it. Defining the data involves giving each
variable a name and specifying missing values. SPSS offer two views of the data file. You
switch from one to the other by clicking the tabs at the bottom left hand side of the screen.
Variable view is where you define the variable names and specify any other information you
want to about the variable.
Naming Variables:
Data are defined variable by variable in variable view (i.e. one row at a time). To do it, click
on cell where it says NAME. Type a meaningful name in the cell. In this case, Ive named the
first variable age. The name can be up to eight characters long and must begin with a letter.
Missing values:
Missing values are what they sound like: data points for which you have no score. For
example, some participants might not have turned up for a data collection session but you still
have other data from them that you want to use. So you need to enter the data you do have
whilst taking account of missing data points. With questionnaires, people often fail to
complete one or more items. This may be purely accidental, in which case the missing at
random because there is no systematic reason for their omission.
Saving the File:
If you havent done so already, you should now save the file so that all your work doesnt go
down the pan if your Machine crashes! Click on file then save from the dropdown menus to
get the save as dialogue box. SPSS data files should be given the Save file name extension,
which is the default when saving a spreadsheet. That way SPSS will always recognise the file
as a data file. Give the file a meaningful name in the file name box, choose where you want
to save it to from the save in box, then click on OK to save it. Remember to save your work
periodically as you enter the data.
Now you can enter the data. Switch to Data view by clicking on the tab at the bottom left-
hand of the screen. Now simply enter each cases scores into the appropriate cells, with each
case taking one row. You can use the arrow keys on the keyboard or the mouse to move from
cell to cell. The value you type wont appear in the cell until you move to another cell but it
does appear in the box above the spreadsheet. If you have missing scores, type in the missing
value that you assigned to that variable.
The variable sex is categorical and needs to be coded to differentiate between males and
females. Any numbers will do; in this case I entered 1 for males and 2 for females. Any such
categorical variable needs to be coded in this way. For example, if you have sata from an
experimental study, a code will need to be given to each group; say, 1 for a treatment group
and 2 for a control group.
Labelling Variables:
In addition to naming variable, you can label them. This means assigning a longer and more
descriptive name to the variable which can make reading the output of any analyses easier.
Label can be up to 256 characters long and can include spaces.
Youll find that the variable label will appear when you mouse over a variable name in the
column heading n Data View. You can get SPSS to print either variable names, or labels, or
both in any outputs by clicking on Edit then Options from the dropdown menus and then
choosing what format you want from the Output Labels tab.
Labelling Variable Values:
You can even assign labels o variable values. For example, you might want to label the values
for sex so that any output that uses this variable will say male and female instead of just
giving the numbers 1 and 2. To do this, in variable View click on the appropriate cell in the
values column, then on the little grey box that appears there to get the value Label dialogue
box. Now type 1 in the Value box, then male in the Value Label Box, and then click on the
Add button.
Measure/Compute:
The level of measurement used for a variable should be correct; otherwise the analysis may
go wrong. It depends on the Qualitative or Quantitative nature of the variable.
After creating the variable, click data view to enter data in the cells. You can use the editor
window like a spread sheet and enter the values accordingly.