Comments on: Data data everywhere but not a scrap of sense http://gov2.net.au/blog/2009/11/16/data-data-everywhere-but-not-a-scrap-of-sense/ Design by Ben Crothers of Catch Media Wed, 28 Apr 2010 12:51:50 +1000 http://wordpress.org/?v=2.8.6 hourly 1 By: Cameron Shorter http://gov2.net.au/blog/2009/11/16/data-data-everywhere-but-not-a-scrap-of-sense/comment-page-1/#comment-4319 Cameron Shorter Tue, 17 Nov 2009 19:17:26 +0000 http://gov2.net.au/?p=1345#comment-4319 Yes, people have been talking about the semantic web for 10 years now, but it has not eventuated in force yet because it is elusively difficult. However the tools and processes are now becoming available to set up a community and standards driven domain. This has been spear headed by Australia's CSIRO in the Geospatial domain, and our Australian company have also been involved. I've blogged about the process at: <a href="http://cameronshorter.blogspot.com/2008/10/community-schemas-making-sense-out-of.html" rel="nofollow">http://cameronshorter.blogspot.com/2008/10/community-schemas-making-sense-out-of.html</a>. Cameron Shorter GeoSpatial Solutions Manager <a href="http://lisasoft.com" rel="nofollow">LISAsoft</a> Yes, people have been talking about the semantic web for 10 years now, but it has not eventuated in force yet because it is elusively difficult. However the tools and processes are now becoming available to set up a community and standards driven domain.

This has been spear headed by Australia’s CSIRO in the Geospatial domain, and our Australian company have also been involved. I’ve blogged about the process at: http://cameronshorter.blogspot.com/2008/10/community-schemas-making-sense-out-of.html.

Cameron Shorter
GeoSpatial Solutions Manager
LISAsoft

]]>
By: Jimi Bostock http://gov2.net.au/blog/2009/11/16/data-data-everywhere-but-not-a-scrap-of-sense/comment-page-1/#comment-4252 Jimi Bostock Mon, 16 Nov 2009 13:46:07 +0000 http://gov2.net.au/?p=1345#comment-4252 A great set of posts that is well punctuated by Kerry and Jo. They are extremely on the scent with the allusion to narrative and story. It is a thought path that will challenge many involved in the brave new world of government. It is going to be hard for people without narrative background and experience to start to think in the ways that Jo and Kerry are alluding to. I often like to think about the sorts of jobs that I can imagine springing up as we move forward in the digital revolution. I wonder about these 'crafters of knowledge' that I like to call them. People who's job it is to apply meaning to digital objects. They are going to be strange creatures, ones that we have not yet met. In organisations, a question will start to come up on what 'part' of the org is going to be 'in charge'. Is it the IT department, the library, communications, or is the answer all of the above. Anyone on this forum knows the answer, it's all about convergence. Building on what Jo well argues, these new knowledge and meaning crafters will be entirely useless without the key ingrediant - the subject expert. I have been often accussed of being just a techie (despite my zero technical skills) but you can be assured I am well aware of the crucial and symbiotic links between the mashers and the subject experts. So, not having been at GovHack, I would still say that if the one thing that came from it was that we needed to get the people who can make a story from the 'data' in the same room as the wizards, then that is an outstanding, dare I say historical, result. A great set of posts that is well punctuated by Kerry and Jo.

They are extremely on the scent with the allusion to narrative and story. It is a thought path that will challenge many involved in the brave new world of government. It is going to be hard for people without narrative background and experience to start to think in the ways that Jo and Kerry are alluding to.

I often like to think about the sorts of jobs that I can imagine springing up as we move forward in the digital revolution. I wonder about these ‘crafters of knowledge’ that I like to call them. People who’s job it is to apply meaning to digital objects. They are going to be strange creatures, ones that we have not yet met.

In organisations, a question will start to come up on what ‘part’ of the org is going to be ‘in charge’. Is it the IT department, the library, communications, or is the answer all of the above. Anyone on this forum knows the answer, it’s all about convergence.

Building on what Jo well argues, these new knowledge and meaning crafters will be entirely useless without the key ingrediant – the subject expert.

I have been often accussed of being just a techie (despite my zero technical skills) but you can be assured I am well aware of the crucial and symbiotic links between the mashers and the subject experts.

So, not having been at GovHack, I would still say that if the one thing that came from it was that we needed to get the people who can make a story from the ‘data’ in the same room as the wizards, then that is an outstanding, dare I say historical, result.

]]>
By: Jo Deeker http://gov2.net.au/blog/2009/11/16/data-data-everywhere-but-not-a-scrap-of-sense/comment-page-1/#comment-4240 Jo Deeker Mon, 16 Nov 2009 04:57:28 +0000 http://gov2.net.au/?p=1345#comment-4240 I'm sure there will be a wash-up (if not I suggest we have a retrospective) now that entries to the Mashup Australia competition have closed. At the last minute as small group of my colleagues entered the competition - we always planned to, but we did our project and client work first and left it till the last minute to get a competition entry in. My reflections on the exercise are that while we have great developers, prototypes and even mature applications available to enable mashups, in the end it's all about the data. <strong>You need to have people involved who are thinking about the data and what meaning they can generate from the data</strong> I think the most powerful combination of people you can have when mashing data is a developer to make the tools, hook into the APIs etc, and present it powerfully, and an analyst or statistician who can help tell a story with the data, or work out how it usefully relates to other things. I observed that the focus of the mashup comp has been towards developers which I thought was interesting. This has maybe resulted from the "just make APIs, don't make applications" opinion that is one opinion out there. If you have APIs, and then you have to make your own applications, what do you get? 50 NSW crime maps all using slightly different flavours of the same technology. <strong>Mashing up data requires something to join on</strong> The most common & interesting data types to join on are spatial or time based. I know there are others. I wanted to work with the DEEWR university stats data but didn't find anything I could join it to. <strong>Mappable data is more fun to work with</strong> Much of the data didn't have spatial characteristics which is a real shame because I bet it's stored somewhere. Even postcode data can be useful (and I know the can of worms I just opened!). <strong>Most of the data needed a lot of work to get it into a mashable format</strong> I also started to work with the BITRE airline data. It required a lot of formatting to get it into a format acceptable by any program. I ended up abandoning it, again, because I couldn't find anything interesting to join it with readily available. There was data I could have used from ABS but it would have required too much reformatting. My favourite format is what the ABS call 'csv' string - a CSV file with categories and then results. It can be easily aggregated and transformed to go into a variety of applications and fits the world I am used to which is relational. However, there are new data sharing formats that are also worthy of consideration such as MDX and SDMX. <strong>Metadata was missing</strong> The dataset I worked with, NSW crime data, had cells that were clearly annotated in the past, but I don't know what the annotations were. I also found an unexplained peak in data that I don't have an explanation for. This is where formats like SDMX could be interesting because they store metadata too. Even the LGAs- I didn't know which year the data related to. I used the ABS 2006 boundaries and they matched (but the 2004 boundaries didn't) so I assume it's a valid match. All up, I really enjoyed participating in the mashup competition, and some of the entries are really really good. I think ours was solid, but with more work on the data, and more time, I would have had a much more compelling story to tell with the data. And I know that any analyst or statistician already knows this! cheers Jo I’m sure there will be a wash-up (if not I suggest we have a retrospective) now that entries to the Mashup Australia competition have closed. At the last minute as small group of my colleagues entered the competition – we always planned to, but we did our project and client work first and left it till the last minute to get a competition entry in. My reflections on the exercise are that while we have great developers, prototypes and even mature applications available to enable mashups, in the end it’s all about the data.

You need to have people involved who are thinking about the data and what meaning they can generate from the data
I think the most powerful combination of people you can have when mashing data is a developer to make the tools, hook into the APIs etc, and present it powerfully, and an analyst or statistician who can help tell a story with the data, or work out how it usefully relates to other things. I observed that the focus of the mashup comp has been towards developers which I thought was interesting. This has maybe resulted from the “just make APIs, don’t make applications” opinion that is one opinion out there. If you have APIs, and then you have to make your own applications, what do you get? 50 NSW crime maps all using slightly different flavours of the same technology.

Mashing up data requires something to join on
The most common & interesting data types to join on are spatial or time based. I know there are others. I wanted to work with the DEEWR university stats data but didn’t find anything I could join it to.

Mappable data is more fun to work with
Much of the data didn’t have spatial characteristics which is a real shame because I bet it’s stored somewhere. Even postcode data can be useful (and I know the can of worms I just opened!).

Most of the data needed a lot of work to get it into a mashable format
I also started to work with the BITRE airline data. It required a lot of formatting to get it into a format acceptable by any program. I ended up abandoning it, again, because I couldn’t find anything interesting to join it with readily available. There was data I could have used from ABS but it would have required too much reformatting.
My favourite format is what the ABS call ‘csv’ string – a CSV file with categories and then results. It can be easily aggregated and transformed to go into a variety of applications and fits the world I am used to which is relational. However, there are new data sharing formats that are also worthy of consideration such as MDX and SDMX.

Metadata was missing
The dataset I worked with, NSW crime data, had cells that were clearly annotated in the past, but I don’t know what the annotations were. I also found an unexplained peak in data that I don’t have an explanation for. This is where formats like SDMX could be interesting because they store metadata too.
Even the LGAs- I didn’t know which year the data related to. I used the ABS 2006 boundaries and they matched (but the 2004 boundaries didn’t) so I assume it’s a valid match.

All up, I really enjoyed participating in the mashup competition, and some of the entries are really really good. I think ours was solid, but with more work on the data, and more time, I would have had a much more compelling story to tell with the data. And I know that any analyst or statistician already knows this!

cheers

Jo

]]>
By: simonfj http://gov2.net.au/blog/2009/11/16/data-data-everywhere-but-not-a-scrap-of-sense/comment-page-1/#comment-4239 simonfj Mon, 16 Nov 2009 04:49:47 +0000 http://gov2.net.au/?p=1345#comment-4239 Thanks Pip, Kerry, Can i get a bit philosophic, cause we need to be. You make the aim of what we are attempting here quite clearly. <blockquote>the agenda of Gov 2.0, and of the whole project of providing transparency and openness in government</blockquote> I've left off the last word <em>data</em>, as I think it throws everyone of the scent of a <em>common sense</em>. <strong>You</strong> might believe that <em>"utilising the power of semantic technologies is going to play a big part in allowing us to address this issue"</em>. I don't. To me, what Nic and Pia have done is far more important. They've used various technologies to improve <em>digital engagement</em>. (and I've said all i have to say <a href="http://blogs.cabinetoffice.gov.uk/digitalengagement/post/2009/10/23/International-links.aspx" rel="nofollow">by doing some introductions</a>. It's hard, I admit, as we move from an age when we expected our .gov (and .edu) institutions to just <em>'deliver a service'</em> to one where we can compile many of services ourselves; if only we had the access rights into a few publicly funded databases. As <a href="http://gov2.net.au/blog/2009/11/11/blank-piece-of-paper-2/#comment-4117" rel="nofollow">I read Kevin's latest entry</a>, I can see it's not just me that simply wants access, after which government can just <em>get out of the way</em>. I've tried to raise a discussion about AGOSP being a <em>way in</em> as well as a <em>way out</em> (of .gov databases) but I'm afraid all our .gov friends can see here is demands for another service to "be delivered". The one thing I'm hoping for, as Nic and Pia offer their recommendations to a Minister (later this year), is that we might <strong><a href="http://www.arcs.org.au/products-services/collaboration-services" rel="nofollow">see the tools</a></strong>, which Kevin and Co will get their hands on when they've got their AAF credentials, used to include a few more 'gem polishers' - some on the .gov.au domain, some in .edu.au, and some in edu's and gov's in foreign countries. Then let them have a play without supervision. (and hopefully they might protect us from another global crisis) The fun part, from what i can see, is going to happen when citizens realize that the .gov.au databases are largely unnecssary, because most of the useful stuff has already been compiled by librarians in the edu space. BTW. You can't <blockquote>integrate some of the processes that are already being developed into the way the Gov 2.0 Task Force itself operates</blockquote>Not until they are mature at least, and that's a while off. In the meantime well just have to share the learning. Thanks Pip, Kerry,

Can i get a bit philosophic, cause we need to be. You make the aim of what we are attempting here quite clearly.

the agenda of Gov 2.0, and of the whole project of providing transparency and openness in government

I’ve left off the last word data, as I think it throws everyone of the scent of a common sense. You might believe that “utilising the power of semantic technologies is going to play a big part in allowing us to address this issue”. I don’t. To me, what Nic and Pia have done is far more important. They’ve used various technologies to improve digital engagement. (and I’ve said all i have to say by doing some introductions.

It’s hard, I admit, as we move from an age when we expected our .gov (and .edu) institutions to just ‘deliver a service’ to one where we can compile many of services ourselves; if only we had the access rights into a few publicly funded databases. As I read Kevin’s latest entry, I can see it’s not just me that simply wants access, after which government can just get out of the way. I’ve tried to raise a discussion about AGOSP being a way in as well as a way out (of .gov databases) but I’m afraid all our .gov friends can see here is demands for another service to “be delivered”.

The one thing I’m hoping for, as Nic and Pia offer their recommendations to a Minister (later this year), is that we might see the tools, which Kevin and Co will get their hands on when they’ve got their AAF credentials, used to include a few more ‘gem polishers’ – some on the .gov.au domain, some in .edu.au, and some in edu’s and gov’s in foreign countries. Then let them have a play without supervision. (and hopefully they might protect us from another global crisis)

The fun part, from what i can see, is going to happen when citizens realize that the .gov.au databases are largely unnecssary, because most of the useful stuff has already been compiled by librarians in the edu space.

BTW. You can’t

integrate some of the processes that are already being developed into the way the Gov 2.0 Task Force itself operates

Not until they are mature at least, and that’s a while off. In the meantime well just have to share the learning.

]]>
By: Kerry Webb http://gov2.net.au/blog/2009/11/16/data-data-everywhere-but-not-a-scrap-of-sense/comment-page-1/#comment-4234 Kerry Webb Mon, 16 Nov 2009 02:25:44 +0000 http://gov2.net.au/?p=1345#comment-4234 Or perhaps if we just concentrate on giving the less-structured data just that bit of extra polish to make it a little more of a gem. The Taskforce did a good job putting together the data sets, but a little more quality control and the contribution of some information specialists would have improved the situation. Or perhaps if we just concentrate on giving the less-structured data just that bit of extra polish to make it a little more of a gem.

The Taskforce did a good job putting together the data sets, but a little more quality control and the contribution of some information specialists would have improved the situation.

]]>