Page 179 - Data Science class 11
P. 179
Thus here, we perform an analysis on pizzas. The study is based on the upvotes pizzas of different pizza crust categories
have received. We can also have a similar kind of table on a web page on the internet. We can store the data shown
in these tables on the internet in an XML.
<?xml version = "1.0"?>
<Pizzas>
<Pizza>
<PizzaTopping> pepperoni </PizzaTopping>
<PeopleChoice> 64% </PeopleChoice>
</Pizza>
<Pizza>
<PizzaTopping> sausage </PizzaTopping>
<PeopleChoice> 56% </PeopleChoice>
</Pizza>
<Pizza>
<PizzaTopping> mushrooms </PizzaTopping>
<PeopleChoice> 54% </PeopleChoice>
</Pizza>
<Pizza>
<PizzaTopping> extra cheese </PizzaTopping>
<PeopleChoice> 52% </PeopleChoice>
</Pizza>
<Pizza>
<PizzaTopping> onions </PizzaTopping>
<PeopleChoice> 48% </PeopleChoice>
</Pizza>
</Pizzas>
4.8.3 XML parsing
Parsing XML refers to going through XML document to access data or to modify data in one or other way. XML Parser
provides way to access or modify data present in an XML document.
Recap
Ÿ In this chapter, we have learnt about use of surveys to collect data. We also understood how to design the questions in a
survey and the different types of questions that may find its place in a survey.
Introducing biasness while collecting sample will give incorrect results.
Ÿ
Sometimes, results from an experiment are stated as an approximation. The maximum range possible between the
Ÿ
approximated value and the actual value is confidence interval.
Data can be collected via different mediums from different places. We can collect temperature data via a thermometer,
Ÿ
while data on the internet can be collected via xml.
Sensors is a device that responds to a physical stimulus and transmits a resulting impulse for automatically controlled
Ÿ
actuators. These sensors respond to changes in heat, light, sound, pressure, magnetism or a motion.
XML stands for extensible Markup Language. It is used as a text-based data format for the exchange of structured data. It
Ÿ
is a markup language that provides rules to define any data.
Ÿ Assumption of independence means that any other data point does not influence each data point in a dataset. In other
words, one data point does not depend on the outcome or value of another data point.
One tailed hypothesis is a statistical hypothesis test set up to show that the sample mean would be higher or lower than
Ÿ
the population mean, but not both.
Randomisation 177

