Raster data models and comparisons with vector

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
Hello friends, and welcome to the fourth lecture. And in the previous lecture I have discussed different types of a vector data, if you recall that I mentioned the data and GIS can be divided in two major components; one or two major types of data; one is spatial data, other one is non- spatial data, we also call them attribute data. Then we discussed different type of vector data,point data, line or polyline data, polygon data. Now, in this lecture we will be discussing about what is basically raster data, and types of raster data as well. So, raster data is having two types; one is the grid type data, another one is the image data. There are no much differences between these two, but still the differences between these two, must be understood very carefully, because otherwise while analysing the data, or handling such data may give you such problems. So, understanding about the intricacies of two different types of raster data, is very important. We will be also comparing, seeing the advantages associated vector data as well as raster data, and this we will be doing little later part of our lecture. So, first the raster data as you know that as a continuous data, vector data as I mentioned is a discrete data. So, there are two different way of representing the real world; one it is in a discrete manner using point line or polygon, may be in the TIN model which we will discuss later, but the topic which we are going to discuss in detail, is the raster, in a continuous fashion we discrete, we represent the real world. Now, in raster what are things which are there, or the key things which are there, it is a uniform grid.You can imagine that if it is a real world scenario, then a uniform grid is lead out on this.This too,the concept of this two is based on sampling theory which is called Senan. Senan was a mathematician and who gave this theory which is called Senan’s sampling theory. What does it mean here in our GIS parlance, is that if you if you represent the real world in a continuous equal size cell grid, then each cell is representing an average value of whatever present on that part of the earth. For example, if you if you are having an agriculture land, within a say ten meter by ten meter area, there might be variation in vegetation or say wheat crop or some crop, but everything is average down; say for example, greenery or may be production or anything, is average down and one single value value is assigned to that cell, that value becomes attribute value of our raster cell, and that is why in case of vector, we can have several attributes or theoretically n number of attributes which each vector entity, but with raster cell or raster pixel, we can have only one single attribute; that is one of the major difference between vector and raster. Apart from that raster is a continuous vector is a discrete data, discontinuous data. As here you can see that data is stored here in a two dimensional matrix, which you are having a set of rows and columns. Now number of rows and number of columns need not to be same, as you know the matrix in mathematics, this is also not compulsory; that means, I can have a rectangular size of my data raster data, or I can have square shape, but apart from these two shapes, I cannot have raster in any other shape. However when we are representing raster for an arbitrary area, and then there are some other concept are applied which we call as no data concept, and by using no data concept on our screen or for our visual, we see arbitrary boundary for raster. In fact, in the system for the outside area it is stored as no data, and for inside that polygon it is stored the real data. So, fundamentally the raster overall shape of the raster will have to be always either rectangle or a square, because it is representing mathematically a two dimensional matrix. Now each cell in case of grid we call as a cell, in case of image we call as pixel, we will see little later all these details. So, each cell will represent one attribute one value it may be in case of remote sensing data, it may be reflection value, it may be emitted value, it may be temperature value, or in normal case may be rainfall value or any other value, but only single value per cell or per pixel, and this cell or pixel becomes the unit of our grid or raster. And another important thing has to remember that the shape of this unit has to be square. So, overall shape of a raster can be square or rectangular; however, shape of a unit has to be always a square. So, that we maintain the uniformity. Now as a though mathematician worked on this, and rather that constraining that each cell each unit has to have the same shape, they came with a new idea that instead of a square we can have triangles, of varied size and shape, and that brought the concept of TIN which we will see little later. So, that a unit of raster is always is a square; overall shape of a raster can be square and rectangular, and each unit of raster will have only one single attribute; not like vector data. So, raster is a continuous, and we are, generally familiar when we are handling the image raster,; then there are image formats like tiff gpeg and gif and different image processing software, are having their own format; like erdas is having its own image format which is very popular one is IMG format. So, there are you know there the data this is how it is represented and then the cells are having. Now the size of a cell can vary, it is not always one meter one by one meter or two meter by two meter, but the shape is fixed which is square, but the size of the cell can change larger the area, the cell is covering of a raster. We say a smaller the spatial resolution, or poorer or courser the spatial resolution. Whereas a cell if it representing a very small area of the ground, then we call as a higher spatial resolution. For example, in a satellite image a one pixel might be representing ten meter by ten meter, and we if we compare a another satellite image which might be representing, a pixel might be representing an area of one meter by one meter; that means, the image which is having one meter cell or one meter pixels, is having higher spatial, relatively higher spatial resolution. Spatial resolution term is really relative, for someone, for some applications hundred spatial resolution may be useful, but for another person even one meter spatial resolution may not be useful. So, we have to be thinking in this direction that is a relative term, as per our requirements we call as a higher spatial resolution, or we call. About the spatial resolution how a different spatial resolution will bring the changes in our image which we will see little later in this in this presentation. Now not only the satellite images which are handled in GIS, but there are other raster data, like a digital elevation model which is which is representing the undulation, or topography of a terrain, or part of the earth; that means, where are the higher grounds, lower grounds, valleys, hills, slopes and all those things, can also be represented in a form of grid or in raster and each cell of that grid will be representing elevation value that is why it is called digital elevation model. Example is given here, and here that attribute of that cell of that raster or grid going to be your elevation value. Whereas in case of image it might be having reflection value, or emitted value, or temperature value, or any other value is represented there, but in satellite images since this is coloured image; therefore, it is constructed using three primary colours, and therefore, you are seeing in colour, but still all are raster, and different type of raster which is a your image raster. So, as I have already mentioned earlier that there are both, you know both are image and grid both are raster model, however there are differences in case of image, the unit we refer as pixel, pixel in abbrevation, which is pixel element, and a the cell in case of grid the unit we call as cell. This makes a while you know calling differently and also gives gives us advantage of understanding, which is required for the next property, or next characteristics, which is the value of the image pixel or value of the cell. In case of image the value of a pixel can only be positive integer value. We cannot have no other value and; that is one another big difference between an image and a grid. Whereas, in case of grid the cell value; that means, the unit value, can be either negative or positive integer or real numbers. So, variety of values can go in case of grid data, but in case of image data only positive integer values. So, these two major differences are there, when we say pixel; I am that means one is talking about image and once a person is talking about an image that the unit value of that image; that means, the pixel value has to be a positive integer value. And when I say cell value; that means, I am talking about grid, and the cell value can be either positive negative integers or real values. So, the grid is having more the having more capabilities or bringing or handling all types of all types of integer and real numbers; whereas, the image can only handle the positive integer number. So, this difference has to be remembered, because if we don’t a take care about this, then because the format for grid are different, then format for image. Image formats we are familiar because now a days we use these digital cameras, and which are also inbuilt in mobiles most of the time we are storing images in JPEG, but we can change the format, we can store the image, and all are these image formats and; that means, we are storing pixel values as positive integer values. If you get sometime you can open some images, zoom it and read the values, you would always find a positive integer value. Now, the next thing is which I test little bit about spatial resolution, which is very very important to understand. Because a this is related, though it’s a related term, it plays a very important role in our day to day life as well as in GIS system. And as we go by the definition is a major of how closely pixels can be resolved in an image; that is called spatial resolution, and it depends on the properties of the system creating the image. Now if a what does it mean that if there are two neighbouring pixels are there, and if they are having very you know a contrasting properties, they might appear in a particular scale, they might appear separately. So, if you take a digital image, may be taken by your mobile camera putted in computer zoom it to a very high level, after sometimes you might realize where the two contrasting things are there, you might realise, two pixel or neighbouring pixels are showing different colours or states. So, when you reduce the scale, or zoom out you know then what happens that these starts resolving, and then individual boundaries of the pixels you will not see. Now, in a in our domain that is in GIS or in remote sensing, this is what we say the clarity of an image, which is decided by the spatial resolution; that means, a an image which is having very high spatial resolution, should be looking much pleasing to eyes; that means, might be that it is having higher spatial resolution; that means, the number of rows in order to cover the same area, number of rows and columns are going to be more, and; that means, the ground area which that high resolution image is representing; that means, one pixel of that image is representing is very very small, comparing to image which is having relatively low resolution. So, sometimes we like in digital mobile cameras we say; ten mega pixel, twelve mega pixel, eighteen mega pixel; that means, we are talking about the dimension of the image. So, larger the mega pixel, the clarity of the image would be higher, higher the spatial resolution. The example is given here, that here the spatial resolution is 110 by 110 kilometre and you are unable to resolve adjacent pixels; and therefore, you are seeing the boundaries of pixels it is a relatively very course image, but when you make this representation is a using a finer grid, and the having a resolution of thirty kilometre by thirty kilometre cells, then at least a some clarity has been introduced in this middle stage, but if a if a you improve further, and then a individual boundaries of two neighbouring pixels are resolved then you will start seeing a much more realistic image; and that is relatively having much higher resolution image then the right most image. Same thing here that, this is one by one; that means, there is nothing no differentiation is there one pixel only. Whereas, here two by two; that means, there are total four pixels, all are having different colours, but we cannot make out what it is written there, or what is what object is within these four pixels, but once we start moving and you know segmenting this area into finer size pixels an image start appearing. Here in 10 by 10 it appears may be r may be a alphabet, but here it starts appearing as a r much clearly, but by 50 by 50 and 100 and 100 becomes much sharper. So, the right most here, is having very high resolution, relatively as compared to the first two three four or five. So, this is this is the major difference which resolution, especially spatial resolution causes in our representation of raster data, or that might be a grid representing a digital elevation model, or might be a satellite image, or photograph taken by your digital camera. Now as I have already mentioned that vector is a discrete raster is a continuous model. So, now, we will try to compare and see what advantages are associated with vector data, and what other advantages are associated with raster data, that these data sets are fundamentally completely different; vector and raster. And if a real world has to be represented in these two formats then you can see that if real world is represented, which is a very simple world of a real world that you are having a stream, you are having a say agricultural land in the background and forest and a house. If I have to represent that one in the vector, then the house is going an area here, depending on the scale in a smaller scale that house might appear as a point data, but here as a polygon, and same with the forest is a polygon. Whereas, a stream though in this state image, it is looking that the stream is occupying an area, but here I am representing as a line feature; that means, only length no area. And if I have to represent the same real world into raster, then house will have this much size; one pixel is representing house, few pixels are representing forest, and then few pixels or cells are representing steam or the river here. Now you will realize that I can increase number of nodes in case of vector, and can make my line very smooth, but in order to make my line smooth, or a steam representation much more smooth, then I need to increase the spatial resolution; that means, the representation through my raster has to be with very small cells, and that means, I am going to occupy more space on my computer, if and more handling or processing time, if I go for higher and higher spatial resolution. So, it depends on the your project requirements, Nowadays all types of all types means, all variety of spatial resolution data are available.For example, if I take digital elevation model, and I take the example of freely available digital elevation models, then starting from one kilometre spatial resolution digital elevation model to up to 30 meter spatial resolution digital elevation models are available. Now but what is the purpose if you are going for a small are area and you are going in much more detailed analysis, then it is always better to go for higher spatial resolution digital elevation model. But if you are covering a very large area a large region or a continent, then it is not worth going for a high spatial resolution, it is better to go for relatively courser of spatial resolution may be ninety meter, may be one kilometre resolution. So, it depends on the requirements of your project and what kind of products you are going to generate after all analysis in GIS. So, variety of spatial resolution, digital elevation model, same with the satellite images nowadays satellite images of 30 centimetre resolutions are available, and satellite images of one kilometre resolutions are also available. So, again if you are looking for a if you are doing analysis on continental scale, then you need not to go for thirty centimetres, because the data handling is going to be very use, and you may not be able to handle data on a one single machine, using standard stand alone GIS software, having a very high spatial resolutions data for a continental areas. So, it is always find out what is the best and then bring in your GIS software and do the analysis. All types of spatial resolution data nowadays are available. Now vector data models as we know that it keeps the data either point line or polygon, and which is a vector data for certain types of features is good, which are having discrete, but a certain types of things which are varying continuously, like for an example given a soil, may not be good to represent through the vector data, because soil varies in nature a very gradually, not suddenly, but if somebody is using for lithological maps then we know, that lithology changes suddenly sometimes, and therefore, may be vector may be useful in that way. So, raster whereas, raster evolved is a continuous data, continuous features, both models are having advantages and disadvantages which we will see very soon, and the modern GIS are able to handle both types of data model, or rather three types of data model vector raster and TIN. Another important thing is that a GIS are nowadays capable of transforming, or converting your raster to vector and vector to raster; however, with some limitations. So, little later we will be also seeing, how we can convert from raster to vector, and vector to raster. So, if we start looking the advantages first with the vector raster data. This is the most common and very simple to understand raster data easy to perform, because mathematically it’s just simply a two dimensional matrix, and therefore, whatever the analysis tools which have been developed in matrix domain of mathematics, all can be implemented easily in GIS. Secondly, all satellite images are also raster, and there are many functionalities, algorithms and every capabilities have been developed, in digital image processing software’s, and therefore, the same thing have also been implemented into GIS. So, raster, because concept wise it is simple, and therefore, lot of tools are available to analyse the raster data. I have already said about the satellite data, because the satellite data becomes one of the very important data input into your GIS, because it is easily available, it is a continuous data and with almost you know every day you can have some satellite data, or another. Whereas, the advantages is vector, the biggest advantage with vector is accurate, because even for a house which might be in a small scale map, might be represented as a point, but in raster it will have an cell; that means, area. So, even a point will have an area whereas, as per the definition in vector point is a zero dimensional entity. So, raster is not that accurate as vector, positional accuracy is very high in case of vector data, and it is compact because it is discrete. There is no redundancy in the data. In case of raster you are having lot of redundancy, and then therefore, there are data compression tools are available, to remove that redundancy, or reduce the size of for storage of raster data, but for vector data there is no redundancy. Once the topology has been built then redundancy is completely gone, and no compression other things are required. So, vector data is very compact, and large area data in vector format can be stored on a small computer space, and can we have has another advantage that in raster you can have only one attribute, but in case of vector you can have multiple. So, that is another advantage. As we know that raster which we have already covered this part, the raster divides into grid cells vector uses point line polygon, units in case of vectors are point line polygons, in case of raster, either cell or pixel. And this processing can be done with grid and convert it from vector to raster, and all modern GIS software’s supports all three types of data handling. Raster has we go finer and finer in the grid cell or pixel size, we go for higher and higher resolution, and later on also, the size can be changed, but resolution spatial resolution cannot be real, in real sense cannot be improved, but on computer point of view you can create more number of rows and columns and therefore, a smaller size cells are there. Now, let me take an example of a soil map. As I have said that soil map, soil is a continuous varying feature, but sometime we have to represent as a polygon map. So, example here is different types of soil map is represented in a polygon map, but if the same soil map, if I have to represent in a raster, then I have to represent something like that. So, both vector and raster representations are there. And what you will see the boundary line of two soil units in case of raster, you are seeing a stair steps case, or also called edge effect of the boundaries; that means, the boundaries are not sharp, and as mentioned earlier in case of vector, the positional accuracy is very high, relatively very high as compared to raster which you can also realize. So, on the edges of two different units, the boundary will have this edge effect or stair steps case, something like stairs which are there, but if you improve the spatial resolution that if you go for final spatial resolution, then it might not be that much obvious, but anyway at every state of raster, at the boundary then it is going arbitrary then you will have this edge effect. So, that is one of the disadvantages with raster. So, finally, what I am going to do. I am going to run through all these things with the raster versus vector, using different characteristics one by one, and see that what are the advantages with vector and raster. First we characteristic which we take is the data structure. By now you must have realized that the vector, because of topology and different types of vectors, relatively vector data is little complex than raster, because raster it is easy to understand, it is easy to handle in computers. So, raster is very simple data structure compared to vector data structure, and therefore, anything which is simple is much easier to learn. So, the ease of learning is also accordingly, that a vector is having difficult. Software’s are complex and expensive, whereas, raster one are much simpler. Positional precision as discussed already that a vector provides very high precisions, and when your data representation is very accurate, precision wise, accuracy wise then you will have a better outputs. In case of raster the output also becomes little difficult. So, precision increase with increase processing time, and data storage, also precision here will also increase with increase of a spatial resolution. Now, attribute precision it is very good in case of vector, and you can have n number of attributes associated with each vector entity point line polygon, but in case of raster it is limited only to one entity, or one attribute value. Compressiveness of analysis capability, it is good for a special query with some limitations, when it comes for intersections, especially when you are overlaying two polygons together. We will say some examples of such limitations. How these have been resolved in GIS that we will also see, but still it is limited in GIS. Whereas, in case of raster; no issue, no problem, because these raster will stack one of another one thing, and if they are belongs to the same coordinate system and same area, then even the spatial resolution is different, does not matter they will stack, and you will not find any difficulty while analysing such raster data layers. The another characteristics which is overlaying ability with vector is good, but after sometime you start seeing splinters; that means, lot of polygon will come after overlaying four five polygon layers, and interpretation or uses of such maps becomes very difficult, but in case of raster that problem will never arise. Now storage requirements as we have already realized that vector require little less storage. it is a discrete data it does not require much storage, and therefore, though it is complex, but it is easy it requires less space. Whereas, raster is simple, but it requires more space, because of redundancy in the data, and when we will discuss the raster data compression techniques then we will go in detail, how what basically redundancy affects the raster data, ability to work with the image data. It is a not as good because image data is raster and, when you are working with the image data and vector data you will have problems. whereas, if you are having a image data image data is also a raster, your grid data is also raster, no problem whatsever. Conversions to other map projections map projection we will discuss in another lecture. So, when we go for different map projections, vector is very good, and most of the GIS modern GIS software’s include change of projection from one projection to another. And if you are having data in vector format it is very easy to convert from one format to another, one projection to another, but with raster data it is rather difficult, and it becomes, note the accuracy part will also come in between. Now ability to work with the network data structure, because vector data why I say, because line data I gave the example that most of this networks, like power grid or sewage line or telephone lines, ofc lines all are networks, and they are stored in computer or in GIS software’s as vector data, and therefore, if you are working in that domain then it is very good with the vector data, but it is with the raster; obviously, it is very very poor. Cost expensive vector is expensive, because each and every node has to be digitized; either wise some semiautomatic method, because yet there is no fully automatic method to convert vector data from analogue to digital. So, maximum the best possible methods are available, semiautomatic methods with the human interventions, lot of human intervention, and therefore, and the vector data is expensive. Whereas,raster data is much easier, and that is why lot of free digital elevation models which are in grid format, which is raster free satellite images are available on the net now a days. So, it is not that expensive, because in one go a satellite can cover a large area, but it is not possible with vector data, and also the output map quality. Here now vector is having advantage very good and looks like a map; whereas, your raster are poor and does not look like a map. Another problem with a raster data, vector, because a laymen or normal people, those who do not know much about GIS or a such a fields, they are familiar with the vector data maps. They have being seeing since very long, maps which are having vector data, but people are not very good on seeing or interpreting satellite images or raster data. So, that is that is why many times though we may be doing lot of analysis using raster data, but in order to make it is much simpler we might be converting finally, into vector. So, that our map quality becomes much much good or better, and looks likes conventional map; otherwise a raster data, those who understand raster data for them there is no issue. Now, I want to cover this about the overlay ability, that I mentioned here that vector is having vector is having limited overlay. So, what does it mean here? That if I take example of two maps both maps are having three units each, so here there are three units a b c, and in land use map there are three units one two three.It’s a very simplest example in real GIS operations this kind of simple operations are not perform many complex operations, and may be that you might be having node three units, might be 30 units in one map or 60 units in one map, but anyway in this example, which is very simple example, attribute one set of attribute which is for a b c, the depth of soil is given here. Whereas, in case of land use which code are or id’s are for different polygons; are one two three, and what kind of land use in different unit of polygons are used for as agriculture and pasture. When we overlay this one on another, using like a transparent sheet, using your set theory concept or Booleans logic, then this kind of intersection map will have seven units now in this particular example. So, we started with three units, and we ended up with the seven units, seven polygons. Some have become very small, and imagine that you are having thirty units in one map and say thirty or forty units, or polygons in another map and such overly operations then you perform then you will end up with hundreds of such polygon, and therefore, the interpretation of such maps becomes difficult. In earlier versions of GIS even overlay of two vector polygons was not very easy, but nowadays it is not that difficult. So, one more field has been added; that means, one more attribute has been added, and initially each has single attribute now it is one more,and this id have also changed. So, you are having soil and land use. So, a 2 a2 is having part of a and soil map and the second unit two of the land use map, and likewise you can have overlay even in vector data, to certain limits. But if you overlay many vector polygon vector maps and then you will end up with very very small small polygons, splinters, and the interpretation of such maps or utility of such maps becomes very difficult. Now I also mention that it is, now a days it is possible to convert from vector to raster and vice versa; however, with some limitations and whenever you convert from one format to another, one model to another you will introduce some errors, but only when it is most essential or compulsory, then convert from one format to another; otherwise originally if it is available in a raster format, keep it as a raster. If it is available in vector format keep it as a vector, because once you convert from one format or one model to another, you are bound to introduce errors, errors are inevitable, and as mentioned in previous lectures that error propagates in GIS. So, our aim should be to keep the error at their minimum level anyway. So, if I want to convert from vector to raster what I need to determine, I need have the row number and column number; and that is what here is, L and E ,L is the line number E is the element; that means, the column number, and these are the two functions F1 and F2 which are polynomial functions. So, again mathematical concepts are coming here again; polynomial equations are used not only in the conversions, later on extensively polynomial equations are used in geo referencing as well. So, when we will discuss we will go through different orders of polynomial equations, there also. And. So, here the input are x and y from your vector data which are your coordinates, and using polynomial functions you will get line number and row number. Similarly in case of raster to vector conversion which is just reverse, that we need to determine these two coordinates appear x and y. So, again there are polynomial equations, and l and e; that means, line number and row number will go as input, and this is what it is, the line is the scan line or line number e is the element or pixel, x is the horizontal coordinate in the map projection, y is the bi-coordinate or vertical coordinate, and F1 to F4 are polynomial functions. So, this is how the conversion from one model to another model, is possible in GIS, but as I have mentioned, any conversion is not fully transparent; that means, it is it is going to bring some errors, because you know that now you know that vector and raster data models are two different contrasting data models, and therefore, you are bound to have problems when you convert from one to another format,and this brings to the end of this my presentation. Thank you very much.
Info
Channel: Introduction to Geographic Information Systems
Views: 29,474
Rating: undefined out of 5
Keywords: Raster, data, models, and, comparisons, with, vector
Id: Dwho7aF4yHo
Channel Id: undefined
Length: 42min 25sec (2545 seconds)
Published: Sat Sep 03 2016
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.