You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@commons.apache.org by "Mark R. Diggory" <md...@latte.harvard.edu> on 2003/06/17 21:20:16 UTC

[math] StatUtils

I've roughed out the beginnings of StatUtils (based on implementations 
found in AbstractStoreUnivariate). I will add this if all seem to agree 
on its usage.

-Mark

Re: [math] StatUtils

Posted by Tim O'Brien <to...@discursive.com>.
+1

On Tue, 17 Jun 2003, Mark R. Diggory wrote:

> I've roughed out the beginnings of StatUtils (based on implementations 
> found in AbstractStoreUnivariate). I will add this if all seem to agree 
> on its usage.
> 
> -Mark
> 

-- 
----------------------
Tim O'Brien
Evanston, IL
(847) 863-7045
tobrien@discursive.com



---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


Re: [math] StatUtils

Posted by "Mark R. Diggory" <md...@latte.harvard.edu>.
Al Chou wrote:
> --- Brent Worden <br...@worden.org> wrote:
> 
>>Looks fine.
>>
>>Two points for discussion:
>>
>>1. Will we ever have the need to compute statistics from sub-arrays? i.e
>>getMean(double[] array, int beginIndex, int endIndex) much like
> 
> 
> I think that's a worthwhile feature.
> 
Not a bad idea, in the case where one would try to do this with the 
current double[] --> double methods it would require an array copy. They 
would be a usefull and efficient alternative.

-Mark



---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


RE: [math] StatUtils

Posted by Al Chou <ho...@yahoo.com>.
--- Brent Worden <br...@worden.org> wrote:
> Looks fine.
> 
> Two points for discussion:
> 
> 1. Will we ever have the need to compute statistics from sub-arrays? i.e
> getMean(double[] array, int beginIndex, int endIndex) much like

I think that's a worthwhile feature.


Al

=====
Albert Davidson Chou

    Get answers to Mac questions at http://www.Mac-Mgrs.org/ .

__________________________________
Do you Yahoo!?
SBC Yahoo! DSL - Now only $29.95 per month!
http://sbc.yahoo.com

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


RE: [math] StatUtils

Posted by Brent Worden <br...@worden.org>.
Looks fine.

Two points for discussion:

1. Will we ever have the need to compute statistics from sub-arrays? i.e
getMean(double[] array, int beginIndex, int endIndex) much like
System.arraycopy.

2. Should we check for null arrays at this level?  Or should it be the
responsibility of the caller to check for null?

Brent Worden
http://www.brent.worden.org

> -----Original Message-----
> From: Mark R. Diggory [mailto:mdiggory@latte.harvard.edu]
> Sent: Tuesday, June 17, 2003 5:05 PM
> To: Jakarta Commons Developers List
> Subject: Re: [math] StatUtils
>
>
> Phil Steitz wrote:
>
> >--- "Mark R. Diggory" <md...@latte.harvard.edu> wrote:
> >
> >
> >>I've roughed out the beginnings of StatUtils (based on implementations
> >>found in AbstractStoreUnivariate). I will add this if all seem to agree
> >>on its usage.
> >>
> >>
> >>
> >+1 for this approach, rolling back previous changes and having
> >AbstractStoreUnivariate and UnivariateImpl in the finite window
> case delegate
> >to these methods.
> >
> >Obviously need full javadoc including NaN conditions, etc.  Also should
> >probably improve the sumsq and  variance computations using
> corrected two-pass.
> >
> >
> >
> +1, +1, just an intial rough draft, the commited version will have
> appropriate javadoc. I'm hoping maybe Al can dig into the two-pass.
> Otherwise that will come next on my list.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: commons-dev-help@jakarta.apache.org
>
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


Re: [math] StatUtils

Posted by "Mark R. Diggory" <md...@latte.harvard.edu>.
Phil Steitz wrote:

>--- "Mark R. Diggory" <md...@latte.harvard.edu> wrote:
>  
>
>>I've roughed out the beginnings of StatUtils (based on implementations 
>>found in AbstractStoreUnivariate). I will add this if all seem to agree 
>>on its usage.
>>
>>    
>>
>+1 for this approach, rolling back previous changes and having
>AbstractStoreUnivariate and UnivariateImpl in the finite window case delegate
>to these methods.
>
>Obviously need full javadoc including NaN conditions, etc.  Also should
>probably improve the sumsq and  variance computations using corrected two-pass.
>
> 
>
+1, +1, just an intial rough draft, the commited version will have 
appropriate javadoc. I'm hoping maybe Al can dig into the two-pass. 
Otherwise that will come next on my list.


---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


Re: [math] StatUtils

Posted by Phil Steitz <st...@yahoo.com>.
--- "Mark R. Diggory" <md...@latte.harvard.edu> wrote:
> I've roughed out the beginnings of StatUtils (based on implementations 
> found in AbstractStoreUnivariate). I will add this if all seem to agree 
> on its usage.
> 
+1 for this approach, rolling back previous changes and having
AbstractStoreUnivariate and UnivariateImpl in the finite window case delegate
to these methods.

Obviously need full javadoc including NaN conditions, etc.  Also should
probably improve the sumsq and  variance computations using corrected two-pass.

 
> -Mark
> > Index: StatUtils.java
> ===================================================================
> RCS file: StatUtils.java
> diff -N StatUtils.java
> --- /dev/null	1 Jan 1970 00:00:00 -0000
> +++ StatUtils.java	17 Jun 2003 19:17:26 -0000
> @@ -0,0 +1,135 @@
> +/* ====================================================================
> + * The Apache Software License, Version 1.1
> + *
> + * Copyright (c) 2003 The Apache Software Foundation.  All rights
> + * reserved.
> + *
> + * Redistribution and use in source and binary forms, with or without
> + * modification, are permitted provided that the following conditions
> + * are met:
> + *
> + * 1. Redistributions of source code must retain the above copyright
> + *    notice, this list of conditions and the following disclaimer.
> + *
> + * 2. Redistributions in binary form must reproduce the above copyright
> + *    notice, this list of conditions and the following disclaimer in
> + *    the documentation and/or other materials provided with the
> + *    distribution.
> + *
> + * 3. The end-user documentation included with the redistribution, if
> + *    any, must include the following acknowlegement:
> + *       "This product includes software developed by the
> + *        Apache Software Foundation (http://www.apache.org/)."
> + *    Alternately, this acknowlegement may appear in the software itself,
> + *    if and wherever such third-party acknowlegements normally appear.
> + *
> + * 4. The names "The Jakarta Project", "Commons", and "Apache Software
> + *    Foundation" must not be used to endorse or promote products derived
> + *    from this software without prior written permission. For written
> + *    permission, please contact apache@apache.org.
> + *
> + * 5. Products derived from this software may not be called "Apache"
> + *    nor may "Apache" appear in their names without prior written
> + *    permission of the Apache Software Foundation.
> + *
> + * THIS SOFTWARE IS PROVIDED ``AS IS'' AND ANY EXPRESSED OR IMPLIED
> + * WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES
> + * OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
> + * DISCLAIMED.  IN NO EVENT SHALL THE APACHE SOFTWARE FOUNDATION OR
> + * ITS CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
> + * SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
> + * LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF
> + * USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND
> + * ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
> + * OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT
> + * OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
> + * SUCH DAMAGE.
> + * ====================================================================
> + *
> + * This software consists of voluntary contributions made by many
> + * individuals on behalf of the Apache Software Foundation.  For more
> + * information on the Apache Software Foundation, please see
> + * <http://www.apache.org/>.
> + */
> +package org.apache.commons.math.stat;
> +
> +/**
> + * StatUtils provides fast static implementation of common double[] based
> + * statistical methods
> + *
> + * @author <a href="mailto:mdiggory@apache.org">Mark Diggory</a>
> + */
> +public class StatUtils {
> +
> +	public static int getN(double[] values) {
> +		return values.length;
> +	}
> +
> +	public static double getSum(double[] values) {
> +		double accum = 0.0;
> +		for (int i = 0; i < getN(values); i++) {
> +			accum += values[i];
> +		}
> +		return accum;
> +	}
> +
> +	public static double getSumSq(double[] values) {
> +		double accum = 0.0;
> +		for (int i = 0; i < getN(values); i++) {
> +			accum += Math.pow(values[i], 2.0);
> +		}
> +		return accum;
> +	}
> +
> +	public static double getMean(double[] values) {
> +		return getSum(values) / getN(values);
> +	}
> +
> +	public static double getVariance(double[] values) {
> +		double variance = Double.NaN;
> +        
> +		if (getN(values) == 1) {
> +			variance = 0;
> +		} else if (getN(values) > 1) {
> +			double mean = getMean(values);
> +			double accum = 0.0;
> +			for (int i = 0; i < getN(values); i++) {
> +				accum += Math.pow((values[i] - mean), 2.0);
> +			}
> +			variance = accum / (getN(values) - 1);
> +		}
> +		return variance;
> +	}
> +
> +	public static double getStandardDeviation(double[] values) {
> +		double stdDev = Double.NaN;
> +		if (getN(values) != 0) {
> +			stdDev = Math.sqrt(getVariance(values));
> +		}
> +		return (stdDev);
> +	}
> +
> +	public static double getMax(double[] values) {
> +		double max = Double.NaN;
> +		for (int i = 0; i < getN(values); i++) {
> +			if (i == 0) {
> +				max = values[i];
> +			} else {
> +				max = Math.max(max, values[i]);
> +			}
> +		}
> +		return max;
> +	}
> +
> +	public static double getMin(double[] values) {
> +		double min = Double.NaN;
> +		for (int i = 0; i < getN(values); i++) {
> +			if (i == 0) {
> +				min = values[i];
> +			} else {
> +				min = Math.min(min, values[i]);
> +			}
> +		}
> +		return min;
> +	}
> +}
> \ No newline at end of file
> 
> > ---------------------------------------------------------------------
> To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: commons-dev-help@jakarta.apache.org


__________________________________
Do you Yahoo!?
SBC Yahoo! DSL - Now only $29.95 per month!
http://sbc.yahoo.com

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org