Datum
I believe that the concept of data has a relatively precise definition, being used with the same interpretation in several areas. In computing there are for example the terms data structure, database and data mining, in telecommunications data transmission, data package, mobile data, while statistics has the concept of statistical data and several other areas also use this term.
All of these areas use this term with the definition you presented on Wikipedia. It is clear that there are small differences, as an example in some telecommunication books the term given is used as synonymous with digital information, countering data transmission and voice transmission, where the voice is not considered to be given because it is an analogue signal, I mean, the data would have to be digital. But apart from these subtleties the concept of data is the same in most cases.
Information
The term information, on the other hand, certainly does not have a precise definition. Shannon himself, considered the father of information theory, wrote the following when trying to define information:
The word "information" has been Given Different meanings by Various Writers in the general field of information Theory. It is likely that at least a number of These will prove sufficiently Useful in Certain Applications to Deserve further Study and Permanent recognition. It is Hardly to be expected that a single Concept of information would satisfactorily Account for the numerous possible Applications of this general field.
Translation:
For the word "information" was given different meanings by various writers in the general field of information theory. It is likely that at least a number of these approaches will be sufficiently useful in certain applications for permanent recognition. It is hardly to be expected that a single concept of information would satisfactorily explain its numerous possible applications.
As an example of this difference of meanings, a large number of scholars argue that thought is particular to the human being and a computer cannot think. There are also several researchers who argue that a computer can have the same form of intelligence as a human being. In certain contexts information is treated as particular to the human being, that is, if in the future it is discovered that a computer really cannot think, or really can think it would change the concept of information. A final definition of information would depend on issues that we do not yet know the answer to, and therefore is still a developing term.
Definition of information
For the context of this question a very useful definition was described by an Italian philosopher named Luciano Floridi which consists of the so-called general definition of information:
The General Definition of Information (GDI):
Σ is an instance of information, understood as semantic content, if and only if:
- Σ consists of one or more data;
- the data in Σ are well-Formed;
- the well-Formed data in Σ are Meaningful.
Translation
The general definition of information (GDI):
Σ is an instance of information, understood as semantic content, if and only if:
- Σ consists of one or more data;
- the data in Σ are well formed;
- the well-formed data in Σ have meaning.
In this definition it is already very clear the difference between data and information and it fits with the understanding of your teacher. Information does not exist without one or more data, but not all data is information. What makes the data become information is the processing and the meaning.
But meaning is an abstract concept and not yet understood in its entirety. If in a given context only a human being can give meaning to a given, then information exists only in the head of a human being. And this can vary from person to person, if a person does not understand binary numbers, a set of these numbers has no meaning to him, if someone else understands, for him it has meaning.
This is a broader definition, which is used in several areas and probably what your teacher was referring to.
Usefulness
But then again, if you have a program that uses data, your data is there and it does what it has to do, what’s the difference whether it is information or not? This definition of what information has to have meaning does not have much use for computing, as it is very subjective.
But Shannon’s definition is extremely useful. In it the information can be measured, so there is the amount of information. The data is measured in bytes and the information too! So a file can have 8KB of data and 1KB of information. If this file is compressed it can stand for example with 4KB of data and the same 1KB of information. How I can unzip the file and get back to the original 8KB no information has been lost!
In this case the information does not depend on having meaning or an observer.
Your question would not be information x knowledge?
– Marco Souza
@GOKUSSJ4 not only information and data, I even thought about it, because I think knowledge is broader, and I want to focus on the context of Computing ;).
– gato
Ah good understood.
– Marco Souza
I think your question is nice, but it seems a little broad. For example, you included the tag [tag:data structure] in it, and in that context (programming a data structure) it may be that what your teacher said makes sense. But in another scope this is not necessarily true. I will not vote to close, however, because I may be mistaken and there is a canonical answer.
– Luiz Vieira
@Luizvieira I put the tag data structure because it’s the part where the most manipulates the data in its raw form, it could be database but I think that wouldn’t make much sense. The purpose of the question is to bring an answer that clarifies the difference between data and information since all Information Technology, Computing and Software Development is based on Data and Information and any other Information System is part of this concept.
– gato
Precisely. In each of these areas you cited the distinction between data and information may not be the same. I just quoted the tag as an example where I think what your teacher said might make sense. But this is not the case in Information Theory, for example.
– Luiz Vieira
@Luizvieira I think that the theory of information addresses this question :), but I found it interesting to force in two question difference between information and data, and as a given becomes information, the answer to these two questions would be so wide?
– gato
If the question is to understand examples like these from
29
and ofM
, I think the answer of rray is enough. Is that it? It seems to be, considering that you assume that datum is prior to information. But as @Luizvieira said, this is one of the possible definitions, within a restricted field. It is difficult to have a universal definition. The wikipedia citation itself doesn’t help much, it seems to be derived from the English entry - which does not cite any primary source, only a half-plus-or-minus secondary. All this to say that I liked the question, but I don’t know how far to answer :)– bfavaretto
I don’t think Information Theory addresses this. I really don’t remember having this distinction, it was all "information". But if you find a source that says otherwise, maybe it’s material for a response from your own. :)
– Luiz Vieira
@Denercarvalho That article (in english) and that link Wikipedia might help clarify a little.
– stderr
@zekk I’ll take a look thanks
– gato