[Buildingdata] Re: outliers
Colin Williams
crw104 at ecs.soton.ac.uk
Tue Apr 16 12:11:09 BST 2013
Chris,
I am using some code to do something similar on opendatamap, specifically for the food hygiene maps as they contain outliers. It's a bit of a hack, using some random sampling, but it's quick and may be useful to you. I can send more details later.
Christopher Gutteridge <cjg at ecs.soton.ac.uk> wrote:
>I was thinking of doing something like that for this map:
>http://maps.southampton.ac.uk/service/?src=http://id.southampton.ac.uk/dataset/places/latest
>
>which is rather screwed up on centering since we opened a building in
>Malaysia...
>
>On 16/04/2013 10:59, Andy Turner wrote:
>> This may be equivalent to your horrible hack, but one can truncate
>(as
>> in a truncated mean statistics) as follows: Calculate the centroid of
>
>> all the points, calculate the distance to all points from this
>> centroid, discount say the 5% of the furthest away points,
>recalculate
>> the bounding box (or circle or whatever you are using) for the
>> remaining points and use this. This could be computationally slow if
>> you are doing it lots of times, and setting the threshold is
>> arbitrary, but this could work. There are various ways to speed this
>> up, for instance you could simply use the centroid of each of the
>> buildings rather than each of their complete geometries. HTH Andy
>>
>>
>> On Tue, Apr 16, 2013 at 10:29 AM, Christopher Gutteridge
>> <cjg at ecs.soton.ac.uk <mailto:cjg at ecs.soton.ac.uk>> wrote:
>>
>> So I've got a situation. On our main campus, the database lists
>two
>> buildings as part of that site, although they are quite a way off
>the
>> main site and really screw up any automatic map renderings as
>they
>> cause
>> the important part to shrink when scaled-to-fit.
>>
>> The ideal solution is to have better data, but I am not allowed
>to
>> edit
>> what estates send me, although I can augment it.
>>
>> I currently remove them with an awful hack in the code, but what
>I'm
>> thinking is creating a relationship between a thing and seomthing
>it's
>> listed as having within, but should not be included on the
>default
>> map. eg.
>>
>> <http://id.southampton.ac.uk/site/1> hasOutlier
>> <http://id.southampton.ac.uk/building/1580> .
>>
>> Any thoughts?
>>
>> --
>> Christopher Gutteridge -- http://users.ecs.soton.ac.uk/cjg
>>
>> University of Southampton Open Data Service:
>> http://data.southampton.ac.uk/
>> You should read the ECS Web Team blog:
>> http://blogs.ecs.soton.ac.uk/webteam/
>>
>> _______________________________________________
>> Buildingdata mailing list
>> Buildingdata at ecs.soton.ac.uk
><mailto:Buildingdata at ecs.soton.ac.uk>
>> http://mailman.ecs.soton.ac.uk/mailman/listinfo/buildingdata
>>
>>
>>
>>
>> _______________________________________________
>> Buildingdata mailing list
>> Buildingdata at ecs.soton.ac.uk
>> http://mailman.ecs.soton.ac.uk/mailman/listinfo/buildingdata
>
>--
>Christopher Gutteridge -- http://users.ecs.soton.ac.uk/cjg
>
>University of Southampton Open Data Service:
>http://data.southampton.ac.uk/
>You should read the ECS Web Team blog:
>http://blogs.ecs.soton.ac.uk/webteam/
>
>
>
>------------------------------------------------------------------------
>
>_______________________________________________
>Buildingdata mailing list
>Buildingdata at ecs.soton.ac.uk
>http://mailman.ecs.soton.ac.uk/mailman/listinfo/buildingdata
Colin Williams
crw104 at ecs.soton.ac.uk
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/buildingdata/attachments/20130416/711c7909/attachment.html
More information about the Buildingdata
mailing list