You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by David Stuart <da...@progressivealliance.co.uk> on 2010/01/09 16:07:55 UTC

Glossary

Hey Mahouters,

Having been subscribed to this list for a couple of months now and in trying to get my head around the some of the big brain discussions that go down here I have started a new Glossary page on the wiki. That will hopefully shed some light on all of the acronyms that get used the email and to help newbies like myself with further reading etc.
As I see new terms I will try to add them (assuming I get the right ones !) but it would be useful if you could add references to papers or good explanations

Heres the link: http://cwiki.apache.org/confluence/display/MAHOUT/Glossary

Cheers,

Dave

Re: Glossary

Posted by Bogdan Vatkov <bo...@gmail.com>.
Greart Dave!
I am a newbie as well and definitelly need something like a glossary.
I will try to support you by putting more terms on the page to be
explained by the guys who have the knowledge :)



On 1/9/10, David Stuart <da...@progressivealliance.co.uk> wrote:
> Hey Mahouters,
>
> Having been subscribed to this list for a couple of months now and in trying
> to get my head around the some of the big brain discussions that go down
> here I have started a new Glossary page on the wiki. That will hopefully
> shed some light on all of the acronyms that get used the email and to help
> newbies like myself with further reading etc.
> As I see new terms I will try to add them (assuming I get the right ones !)
> but it would be useful if you could add references to papers or good
> explanations
>
> Heres the link: http://cwiki.apache.org/confluence/display/MAHOUT/Glossary
>
> Cheers,
>
> Dave

-- 
Sent from Google Mail for mobile | mobile.google.com

Best regards,
Bogdan

Re: Glossary

Posted by David Stuart <da...@progressivealliance.co.uk>.
Thanks Ted,

Yea as Grant put, I often find going to wikipedia can confuse me more and sometimes there are acronyms in the statistic scope that mean different things


On 9 Jan 2010, at 20:27, Grant Ingersoll wrote:

> 
> On Jan 9, 2010, at 3:21 PM, Ted Dunning wrote:
> 
>> Dave,
>> 
>> Great work.  As you may have seen, I have gone in and specialized the
>> definitions to make note of the Mahout and machine learning context that
>> they are used in here.
> 
> +1
> 
>> 
>> Andrew,
>> 
>> These terms often do have wikipedia definitions and these should definitely
>> be linked from the glossary entries, but the usage of these terms in this
>> project is often somewhat specialized.  That specialized should be called
>> out in our glossary  and contrasted with the more general use in the world
>> at large.
> 
> +1.  I often find it takes a bit to get from the Wikipedia explanation to the context of machine learning and/or Mahout, so good to have links to background reading (incl. Wikipedia) as well as definitions relevant to our approaches/discussions.
> 
>> 
>> On Sat, Jan 9, 2010 at 9:56 AM, Andrew Wang <an...@gmail.com>wrote:
>> 
>>> David,
>>> 
>>> When i said wiki, I mean wikipedia. The terms,  such as, "map reduce,
>>> TFIDF", in the Glossary you posted are already available in wikipedia.
>>> Maybe, for those terms, we just can put a URL ref in the Glossary. for
>>> other
>>> particular terms (the names of the algorithms implemented with mapreduce
>>> mode in Mahout ), we can explain them further.
>>> 
>>> Hope this tips are helpfully!
>>> 
>>> On Sun, Jan 10, 2010 at 1:32 AM, David Stuart <
>>> david.stuart@progressivealliance.co.uk> wrote:
>>> 
>>>> Hi Andrew,
>>>> 
>>>> Thanks for the help. Re the wiki I thought thats where I had created it
>>> is
>>>> there another place to put it?
>>>> 
>>>> Regards
>>>> 
>>>> On 9 Jan 2010, at 15:42, Andrew Wang wrote:
>>>> 
>>>>> Hi,David,
>>>>> 
>>>>> It sounds like a good idea to have somewhere to introduce the useful
>>>>> definiation mentioned in this maillist. However, the wiki maybe a
>>> better
>>>>> space to find this infomation. Anyway, i am new guy here, nice to find
>>>>> somebody else studing mahout at same time. Good luck!
>>>>> 
>>>>> 
>>>>> 
>>>>> On Sat, Jan 9, 2010 at 11:07 PM, David Stuart <
>>>>> david.stuart@progressivealliance.co.uk> wrote:
>>>>> 
>>>>>> Hey Mahouters,
>>>>>> 
>>>>>> Having been subscribed to this list for a couple of months now and in
>>>>>> trying to get my head around the some of the big brain discussions
>>> that
>>>> go
>>>>>> down here I have started a new Glossary page on the wiki. That will
>>>>>> hopefully shed some light on all of the acronyms that get used the
>>> email
>>>> and
>>>>>> to help newbies like myself with further reading etc.
>>>>>> As I see new terms I will try to add them (assuming I get the right
>>> ones
>>>> !)
>>>>>> but it would be useful if you could add references to papers or good
>>>>>> explanations
>>>>>> 
>>>>>> Heres the link:
>>>> http://cwiki.apache.org/confluence/display/MAHOUT/Glossary
>>>>>> 
>>>>>> Cheers,
>>>>>> 
>>>>>> Dave
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> --
>>>>> http://anqiang1900.blog.163.com/
>>>> 
>>>> 
>>> 
>>> 
>>> --
>>> http://anqiang1900.blog.163.com/
>>> 
>> 
>> 
>> 
>> -- 
>> Ted Dunning, CTO
>> DeepDyve
> 
> --------------------------
> Grant Ingersoll
> http://www.lucidimagination.com/
> 
> Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search
> 


Re: Glossary

Posted by Grant Ingersoll <gs...@apache.org>.
On Jan 9, 2010, at 3:21 PM, Ted Dunning wrote:

> Dave,
> 
> Great work.  As you may have seen, I have gone in and specialized the
> definitions to make note of the Mahout and machine learning context that
> they are used in here.

+1

> 
> Andrew,
> 
> These terms often do have wikipedia definitions and these should definitely
> be linked from the glossary entries, but the usage of these terms in this
> project is often somewhat specialized.  That specialized should be called
> out in our glossary  and contrasted with the more general use in the world
> at large.

+1.  I often find it takes a bit to get from the Wikipedia explanation to the context of machine learning and/or Mahout, so good to have links to background reading (incl. Wikipedia) as well as definitions relevant to our approaches/discussions.

> 
> On Sat, Jan 9, 2010 at 9:56 AM, Andrew Wang <an...@gmail.com>wrote:
> 
>> David,
>> 
>> When i said wiki, I mean wikipedia. The terms,  such as, "map reduce,
>> TFIDF", in the Glossary you posted are already available in wikipedia.
>> Maybe, for those terms, we just can put a URL ref in the Glossary. for
>> other
>> particular terms (the names of the algorithms implemented with mapreduce
>> mode in Mahout ), we can explain them further.
>> 
>> Hope this tips are helpfully!
>> 
>> On Sun, Jan 10, 2010 at 1:32 AM, David Stuart <
>> david.stuart@progressivealliance.co.uk> wrote:
>> 
>>> Hi Andrew,
>>> 
>>> Thanks for the help. Re the wiki I thought thats where I had created it
>> is
>>> there another place to put it?
>>> 
>>> Regards
>>> 
>>> On 9 Jan 2010, at 15:42, Andrew Wang wrote:
>>> 
>>>> Hi,David,
>>>> 
>>>> It sounds like a good idea to have somewhere to introduce the useful
>>>> definiation mentioned in this maillist. However, the wiki maybe a
>> better
>>>> space to find this infomation. Anyway, i am new guy here, nice to find
>>>> somebody else studing mahout at same time. Good luck!
>>>> 
>>>> 
>>>> 
>>>> On Sat, Jan 9, 2010 at 11:07 PM, David Stuart <
>>>> david.stuart@progressivealliance.co.uk> wrote:
>>>> 
>>>>> Hey Mahouters,
>>>>> 
>>>>> Having been subscribed to this list for a couple of months now and in
>>>>> trying to get my head around the some of the big brain discussions
>> that
>>> go
>>>>> down here I have started a new Glossary page on the wiki. That will
>>>>> hopefully shed some light on all of the acronyms that get used the
>> email
>>> and
>>>>> to help newbies like myself with further reading etc.
>>>>> As I see new terms I will try to add them (assuming I get the right
>> ones
>>> !)
>>>>> but it would be useful if you could add references to papers or good
>>>>> explanations
>>>>> 
>>>>> Heres the link:
>>> http://cwiki.apache.org/confluence/display/MAHOUT/Glossary
>>>>> 
>>>>> Cheers,
>>>>> 
>>>>> Dave
>>>> 
>>>> 
>>>> 
>>>> 
>>>> --
>>>> http://anqiang1900.blog.163.com/
>>> 
>>> 
>> 
>> 
>> --
>> http://anqiang1900.blog.163.com/
>> 
> 
> 
> 
> -- 
> Ted Dunning, CTO
> DeepDyve

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search


Re: Glossary

Posted by Ted Dunning <te...@gmail.com>.
Dave,

Great work.  As you may have seen, I have gone in and specialized the
definitions to make note of the Mahout and machine learning context that
they are used in here.

Andrew,

These terms often do have wikipedia definitions and these should definitely
be linked from the glossary entries, but the usage of these terms in this
project is often somewhat specialized.  That specialized should be called
out in our glossary  and contrasted with the more general use in the world
at large.

On Sat, Jan 9, 2010 at 9:56 AM, Andrew Wang <an...@gmail.com>wrote:

> David,
>
> When i said wiki, I mean wikipedia. The terms,  such as, "map reduce,
> TFIDF", in the Glossary you posted are already available in wikipedia.
> Maybe, for those terms, we just can put a URL ref in the Glossary. for
> other
> particular terms (the names of the algorithms implemented with mapreduce
> mode in Mahout ), we can explain them further.
>
> Hope this tips are helpfully!
>
> On Sun, Jan 10, 2010 at 1:32 AM, David Stuart <
> david.stuart@progressivealliance.co.uk> wrote:
>
> > Hi Andrew,
> >
> > Thanks for the help. Re the wiki I thought thats where I had created it
> is
> > there another place to put it?
> >
> > Regards
> >
> > On 9 Jan 2010, at 15:42, Andrew Wang wrote:
> >
> > > Hi,David,
> > >
> > > It sounds like a good idea to have somewhere to introduce the useful
> > > definiation mentioned in this maillist. However, the wiki maybe a
> better
> > > space to find this infomation. Anyway, i am new guy here, nice to find
> > > somebody else studing mahout at same time. Good luck!
> > >
> > >
> > >
> > > On Sat, Jan 9, 2010 at 11:07 PM, David Stuart <
> > > david.stuart@progressivealliance.co.uk> wrote:
> > >
> > >> Hey Mahouters,
> > >>
> > >> Having been subscribed to this list for a couple of months now and in
> > >> trying to get my head around the some of the big brain discussions
> that
> > go
> > >> down here I have started a new Glossary page on the wiki. That will
> > >> hopefully shed some light on all of the acronyms that get used the
> email
> > and
> > >> to help newbies like myself with further reading etc.
> > >> As I see new terms I will try to add them (assuming I get the right
> ones
> > !)
> > >> but it would be useful if you could add references to papers or good
> > >> explanations
> > >>
> > >> Heres the link:
> > http://cwiki.apache.org/confluence/display/MAHOUT/Glossary
> > >>
> > >> Cheers,
> > >>
> > >> Dave
> > >
> > >
> > >
> > >
> > > --
> > > http://anqiang1900.blog.163.com/
> >
> >
>
>
> --
> http://anqiang1900.blog.163.com/
>



-- 
Ted Dunning, CTO
DeepDyve

Re: Glossary

Posted by Andrew Wang <an...@gmail.com>.
David,

When i said wiki, I mean wikipedia. The terms,  such as, "map reduce,
TFIDF", in the Glossary you posted are already available in wikipedia.
Maybe, for those terms, we just can put a URL ref in the Glossary. for other
particular terms (the names of the algorithms implemented with mapreduce
mode in Mahout ), we can explain them further.

Hope this tips are helpfully!

On Sun, Jan 10, 2010 at 1:32 AM, David Stuart <
david.stuart@progressivealliance.co.uk> wrote:

> Hi Andrew,
>
> Thanks for the help. Re the wiki I thought thats where I had created it is
> there another place to put it?
>
> Regards
>
> On 9 Jan 2010, at 15:42, Andrew Wang wrote:
>
> > Hi,David,
> >
> > It sounds like a good idea to have somewhere to introduce the useful
> > definiation mentioned in this maillist. However, the wiki maybe a better
> > space to find this infomation. Anyway, i am new guy here, nice to find
> > somebody else studing mahout at same time. Good luck!
> >
> >
> >
> > On Sat, Jan 9, 2010 at 11:07 PM, David Stuart <
> > david.stuart@progressivealliance.co.uk> wrote:
> >
> >> Hey Mahouters,
> >>
> >> Having been subscribed to this list for a couple of months now and in
> >> trying to get my head around the some of the big brain discussions that
> go
> >> down here I have started a new Glossary page on the wiki. That will
> >> hopefully shed some light on all of the acronyms that get used the email
> and
> >> to help newbies like myself with further reading etc.
> >> As I see new terms I will try to add them (assuming I get the right ones
> !)
> >> but it would be useful if you could add references to papers or good
> >> explanations
> >>
> >> Heres the link:
> http://cwiki.apache.org/confluence/display/MAHOUT/Glossary
> >>
> >> Cheers,
> >>
> >> Dave
> >
> >
> >
> >
> > --
> > http://anqiang1900.blog.163.com/
>
>


-- 
http://anqiang1900.blog.163.com/

Re: Glossary

Posted by David Stuart <da...@progressivealliance.co.uk>.
Hi Andrew,

Thanks for the help. Re the wiki I thought thats where I had created it is there another place to put it?

Regards

On 9 Jan 2010, at 15:42, Andrew Wang wrote:

> Hi,David,
> 
> It sounds like a good idea to have somewhere to introduce the useful
> definiation mentioned in this maillist. However, the wiki maybe a better
> space to find this infomation. Anyway, i am new guy here, nice to find
> somebody else studing mahout at same time. Good luck!
> 
> 
> 
> On Sat, Jan 9, 2010 at 11:07 PM, David Stuart <
> david.stuart@progressivealliance.co.uk> wrote:
> 
>> Hey Mahouters,
>> 
>> Having been subscribed to this list for a couple of months now and in
>> trying to get my head around the some of the big brain discussions that go
>> down here I have started a new Glossary page on the wiki. That will
>> hopefully shed some light on all of the acronyms that get used the email and
>> to help newbies like myself with further reading etc.
>> As I see new terms I will try to add them (assuming I get the right ones !)
>> but it would be useful if you could add references to papers or good
>> explanations
>> 
>> Heres the link: http://cwiki.apache.org/confluence/display/MAHOUT/Glossary
>> 
>> Cheers,
>> 
>> Dave
> 
> 
> 
> 
> -- 
> http://anqiang1900.blog.163.com/


Re: Glossary

Posted by Andrew Wang <an...@gmail.com>.
Hi,David,

It sounds like a good idea to have somewhere to introduce the useful
definiation mentioned in this maillist. However, the wiki maybe a better
space to find this infomation. Anyway, i am new guy here, nice to find
somebody else studing mahout at same time. Good luck!



On Sat, Jan 9, 2010 at 11:07 PM, David Stuart <
david.stuart@progressivealliance.co.uk> wrote:

> Hey Mahouters,
>
> Having been subscribed to this list for a couple of months now and in
> trying to get my head around the some of the big brain discussions that go
> down here I have started a new Glossary page on the wiki. That will
> hopefully shed some light on all of the acronyms that get used the email and
> to help newbies like myself with further reading etc.
> As I see new terms I will try to add them (assuming I get the right ones !)
> but it would be useful if you could add references to papers or good
> explanations
>
> Heres the link: http://cwiki.apache.org/confluence/display/MAHOUT/Glossary
>
> Cheers,
>
> Dave




-- 
http://anqiang1900.blog.163.com/

Re: Glossary

Posted by Isabel Drost <is...@apache.org>.
On 09.01.2010 David Stuart wrote:
> Heres the link: http://cwiki.apache.org/confluence/display/MAHOUT/Glossary

+1 Thanks for creating the page.

Isabel

Re: Glossary

Posted by Drew Farris <dr...@gmail.com>.
+1 it is great to get something like this started.

On Jan 9, 2010 10:08 AM, "David Stuart" <
david.stuart@progressivealliance.co.uk> wrote:

Hey Mahouters,

Having been subscribed to this list for a couple of months now and in trying
to get my head around the some of the big brain discussions that go down
here I have started a new Glossary page on the wiki. That will hopefully
shed some light on all of the acronyms that get used the email and to help
newbies like myself with further reading etc.
As I see new terms I will try to add them (assuming I get the right ones !)
but it would be useful if you could add references to papers or good
explanations

Heres the link: http://cwiki.apache.org/confluence/display/MAHOUT/Glossary

Cheers,

Dave