You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Sree Eedupuganti <sr...@inndata.in> on 2016/05/06 11:07:16 UTC

Recommendation Engine based on Content Filtering

Command : *./mahout recommenditembased
-Dmapred.input.dir=/user/temp/input.txt
-Dmapred.output.dir=/user/temp/output --usersFile /user/mahout/users.txt
--numRecommendations 2 --booleanData --similarityClassname
SIMILARITY_LOGLIKELIHOOD*

Error:
 16/05/06 11:00:34 INFO Job: Running job: job_1461844363112_0017
16/05/06 11:00:39 INFO Job: Job job_1461844363112_0017 running in uber mode
: false
16/05/06 11:00:39 INFO Job:  map 0% reduce 0%
16/05/06 11:00:42 INFO Job: Task Id :
attempt_1461844363112_0017_m_000000_0, Status : FAILED
Error: java.lang.NumberFormatException: For input string: "
leticia9jdqf@gmail.com"

Any suggestions please
-- 
Best Regards,
Sreeharsha Eedupuganti
Data Engineer
innData Analytics Private Limited

Re: Recommendation Engine based on Content Filtering

Posted by Sree Eedupuganti <sr...@inndata.in>.
This is a sample data trying to build a recommendation engine.

On Fri, May 6, 2016 at 4:58 PM, Sebastian <ss...@apache.org> wrote:

> Please don't post data from real users on this list without their consent.
>
> On 06.05.2016 13:26, Sree Eedupuganti wrote:
>
>>  From the below sample data, i have to recommend user based on email any
>> suggestions please.
>>
>> leticia9jdqf@gmail.com    861102    Associations    hrutnjal@aol.com
>> leticia9jdqf@gmail.com    sexy black full
>> lips    Others    570485c10768cb0006e168fa    32202158    1    2016-04-06
>> 03:16:46.000    2016-04-06 02:38:56.000
>> 7af136b6.15fb.153e96fe052.Coremail.leticia9jdqf@126.com    2016-04-06
>> 03:14:00.000    2016-04-06 03:14:00.000    2016-04-06
>> info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
>> info@yk.com    2YK: IMPORTANT - Your
>> password    Others    5704685c623ccb0006885856    22246    1    2016-04-06
>> 01:03:53.000    2016-04-06 00:51:08.000
>> B0547344678@mail.infoquesthosting.net    2016-04-06
>> 00:55:19.000    2016-04-06 00:55:19.000    2016-04-06
>> info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
>> info@yk.com    2YK: IMPORTANT - Your
>> password    Others    5704685c623ccb0006885857    22245    1    2016-04-06
>> 01:03:53.000    2016-04-06 00:52:13.000
>> B0547344691@mail.infoquesthosting.net    2016-04-06
>> 00:55:18.000    2016-04-06 00:55:18.000    2016-04-06
>> info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
>> info@yk.com    Regarding activation of your account from2YK.com
>> <http://2yk.com/>    Others    57046f0ceff8c500079afc31    22248    1
>>    2016-04-06
>> 01:42:57.000    2016-04-06 01:18:04.000
>> B0547345071@mail.infoquesthosting.net    2016-04-06
>> 01:28:30.000    2016-04-06 01:28:30.000    2016-04-06
>> info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
>> info@yk.com    Status of your 2YK.com <http://2yk.com/> Order#-238176
>> New
>>
>> Orders    57046929623ccb0006886191    22247    1    2016-04-06
>> 01:19:15.000    2016-04-06 01:06:17.000
>> B0547344903@mail.infoquesthosting.net    2016-04-06
>> 01:17:38.000    2016-04-06 01:17:38.000    2016-04-06
>> contact@daymillionaire.co    912103    Government Offices-County
>> brianhughes71423@gmail.com    amberoptions@binaryoptions.com.co    You
>> are
>> owed
>> $25,718.19    Others    5704dac380c7f600052909f9    37480    1
>> 2016-04-06
>> 09:45:33.000    2016-04-06 07:02:24.000
>> contact@daymillionaire.co    912103    Government Offices-County
>> kennehls@yahoo.com    amberoptions@binaryoptions.com.co    You are owed
>> $25,718.19    Others    5704cfb7e8013500092f6bca    535810
>> 1296685215    2016-04-06
>> 07:32:00.000    2016-04-06 07:26:20.000
>> contact@daymillionaire.co    912103    Government Offices-County
>> w.stacy65@yahoo.com    amberoptions@binaryoptions.com.co    You are owed
>> $25,718.19    Others    5704d047e8013500092f7333    800805
>> 1321086091    2016-04-06
>> 08:15:25.000    2016-04-06 07:56:25.000
>> info@dimes.eu    784102    Video Tapes & Discs-Renting & Leasing
>> zactopayne@yahoo.com    info@dimes.eu    IMPORTANT!. 5Dimes Account
>> Information
>> Recovery    Others    57045dcd2a69890005c879a2    186992    1367767401
>>     2016-04-06
>> 00:39:57.000    2016-04-06
>> 00:33:49.000    B0074208073@newmailserver.5dom.dom    2016-04-06
>> 00:34:57.000    2016-04-06 00:34:57.000    2016-04-06
>>
>> On Fri, May 6, 2016 at 4:37 PM, Sree Eedupuganti <sr...@inndata.in> wrote:
>>
>> Command : *./mahout recommenditembased
>>> -Dmapred.input.dir=/user/temp/input.txt
>>> -Dmapred.output.dir=/user/temp/output --usersFile /user/mahout/users.txt
>>> --numRecommendations 2 --booleanData --similarityClassname
>>> SIMILARITY_LOGLIKELIHOOD*
>>>
>>> Error:
>>>   16/05/06 11:00:34 INFO Job: Running job: job_1461844363112_0017
>>> 16/05/06 11:00:39 INFO Job: Job job_1461844363112_0017 running in uber
>>> mode : false
>>> 16/05/06 11:00:39 INFO Job:  map 0% reduce 0%
>>> 16/05/06 11:00:42 INFO Job: Task Id :
>>> attempt_1461844363112_0017_m_000000_0, Status : FAILED
>>> Error: java.lang.NumberFormatException: For input string: "
>>> leticia9jdqf@gmail.com"
>>>
>>> Any suggestions please
>>> --
>>> Best Regards,
>>> Sreeharsha Eedupuganti
>>> Data Engineer
>>> innData Analytics Private Limited
>>>
>>>
>>
>>
>>


-- 
Best Regards,
Sreeharsha Eedupuganti
Data Engineer
innData Analytics Private Limited

Re: Recommendation Engine based on Content Filtering

Posted by Sebastian <ss...@apache.org>.
Please don't post data from real users on this list without their consent.

On 06.05.2016 13:26, Sree Eedupuganti wrote:
>  From the below sample data, i have to recommend user based on email any
> suggestions please.
>
> leticia9jdqf@gmail.com    861102    Associations    hrutnjal@aol.com
> leticia9jdqf@gmail.com    sexy black full
> lips    Others    570485c10768cb0006e168fa    32202158    1    2016-04-06
> 03:16:46.000    2016-04-06 02:38:56.000
> 7af136b6.15fb.153e96fe052.Coremail.leticia9jdqf@126.com    2016-04-06
> 03:14:00.000    2016-04-06 03:14:00.000    2016-04-06
> info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
> info@yk.com    2YK: IMPORTANT - Your
> password    Others    5704685c623ccb0006885856    22246    1    2016-04-06
> 01:03:53.000    2016-04-06 00:51:08.000
> B0547344678@mail.infoquesthosting.net    2016-04-06
> 00:55:19.000    2016-04-06 00:55:19.000    2016-04-06
> info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
> info@yk.com    2YK: IMPORTANT - Your
> password    Others    5704685c623ccb0006885857    22245    1    2016-04-06
> 01:03:53.000    2016-04-06 00:52:13.000
> B0547344691@mail.infoquesthosting.net    2016-04-06
> 00:55:18.000    2016-04-06 00:55:18.000    2016-04-06
> info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
> info@yk.com    Regarding activation of your account from2YK.com
> <http://2yk.com/>    Others    57046f0ceff8c500079afc31    22248    1
>    2016-04-06
> 01:42:57.000    2016-04-06 01:18:04.000
> B0547345071@mail.infoquesthosting.net    2016-04-06
> 01:28:30.000    2016-04-06 01:28:30.000    2016-04-06
> info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
> info@yk.com    Status of your 2YK.com <http://2yk.com/> Order#-238176    New
> Orders    57046929623ccb0006886191    22247    1    2016-04-06
> 01:19:15.000    2016-04-06 01:06:17.000
> B0547344903@mail.infoquesthosting.net    2016-04-06
> 01:17:38.000    2016-04-06 01:17:38.000    2016-04-06
> contact@daymillionaire.co    912103    Government Offices-County
> brianhughes71423@gmail.com    amberoptions@binaryoptions.com.co    You are
> owed
> $25,718.19    Others    5704dac380c7f600052909f9    37480    1    2016-04-06
> 09:45:33.000    2016-04-06 07:02:24.000
> contact@daymillionaire.co    912103    Government Offices-County
> kennehls@yahoo.com    amberoptions@binaryoptions.com.co    You are owed
> $25,718.19    Others    5704cfb7e8013500092f6bca    535810
> 1296685215    2016-04-06
> 07:32:00.000    2016-04-06 07:26:20.000
> contact@daymillionaire.co    912103    Government Offices-County
> w.stacy65@yahoo.com    amberoptions@binaryoptions.com.co    You are owed
> $25,718.19    Others    5704d047e8013500092f7333    800805
> 1321086091    2016-04-06
> 08:15:25.000    2016-04-06 07:56:25.000
> info@dimes.eu    784102    Video Tapes & Discs-Renting & Leasing
> zactopayne@yahoo.com    info@dimes.eu    IMPORTANT!. 5Dimes Account
> Information
> Recovery    Others    57045dcd2a69890005c879a2    186992    1367767401
>     2016-04-06
> 00:39:57.000    2016-04-06
> 00:33:49.000    B0074208073@newmailserver.5dom.dom    2016-04-06
> 00:34:57.000    2016-04-06 00:34:57.000    2016-04-06
>
> On Fri, May 6, 2016 at 4:37 PM, Sree Eedupuganti <sr...@inndata.in> wrote:
>
>> Command : *./mahout recommenditembased
>> -Dmapred.input.dir=/user/temp/input.txt
>> -Dmapred.output.dir=/user/temp/output --usersFile /user/mahout/users.txt
>> --numRecommendations 2 --booleanData --similarityClassname
>> SIMILARITY_LOGLIKELIHOOD*
>>
>> Error:
>>   16/05/06 11:00:34 INFO Job: Running job: job_1461844363112_0017
>> 16/05/06 11:00:39 INFO Job: Job job_1461844363112_0017 running in uber
>> mode : false
>> 16/05/06 11:00:39 INFO Job:  map 0% reduce 0%
>> 16/05/06 11:00:42 INFO Job: Task Id :
>> attempt_1461844363112_0017_m_000000_0, Status : FAILED
>> Error: java.lang.NumberFormatException: For input string: "
>> leticia9jdqf@gmail.com"
>>
>> Any suggestions please
>> --
>> Best Regards,
>> Sreeharsha Eedupuganti
>> Data Engineer
>> innData Analytics Private Limited
>>
>
>
>

Re: Recommendation Engine based on Content Filtering

Posted by Sree Eedupuganti <sr...@inndata.in>.
From the below sample data, i have to recommend user based on email any
suggestions please.

leticia9jdqf@gmail.com    861102    Associations    hrutnjal@aol.com
leticia9jdqf@gmail.com    sexy black full
lips    Others    570485c10768cb0006e168fa    32202158    1    2016-04-06
03:16:46.000    2016-04-06 02:38:56.000
7af136b6.15fb.153e96fe052.Coremail.leticia9jdqf@126.com    2016-04-06
03:14:00.000    2016-04-06 03:14:00.000    2016-04-06
info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
info@yk.com    2YK: IMPORTANT - Your
password    Others    5704685c623ccb0006885856    22246    1    2016-04-06
01:03:53.000    2016-04-06 00:51:08.000
B0547344678@mail.infoquesthosting.net    2016-04-06
00:55:19.000    2016-04-06 00:55:19.000    2016-04-06
info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
info@yk.com    2YK: IMPORTANT - Your
password    Others    5704685c623ccb0006885857    22245    1    2016-04-06
01:03:53.000    2016-04-06 00:52:13.000
B0547344691@mail.infoquesthosting.net    2016-04-06
00:55:18.000    2016-04-06 00:55:18.000    2016-04-06
info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
info@yk.com    Regarding activation of your account from2YK.com
<http://2yk.com/>    Others    57046f0ceff8c500079afc31    22248    1
  2016-04-06
01:42:57.000    2016-04-06 01:18:04.000
B0547345071@mail.infoquesthosting.net    2016-04-06
01:28:30.000    2016-04-06 01:28:30.000    2016-04-06
info@yk.com    701101    Hotels & Motels    saumilpatel3007@gmail.com
info@yk.com    Status of your 2YK.com <http://2yk.com/> Order#-238176    New
Orders    57046929623ccb0006886191    22247    1    2016-04-06
01:19:15.000    2016-04-06 01:06:17.000
B0547344903@mail.infoquesthosting.net    2016-04-06
01:17:38.000    2016-04-06 01:17:38.000    2016-04-06
contact@daymillionaire.co    912103    Government Offices-County
brianhughes71423@gmail.com    amberoptions@binaryoptions.com.co    You are
owed
$25,718.19    Others    5704dac380c7f600052909f9    37480    1    2016-04-06
09:45:33.000    2016-04-06 07:02:24.000
contact@daymillionaire.co    912103    Government Offices-County
kennehls@yahoo.com    amberoptions@binaryoptions.com.co    You are owed
$25,718.19    Others    5704cfb7e8013500092f6bca    535810
1296685215    2016-04-06
07:32:00.000    2016-04-06 07:26:20.000
contact@daymillionaire.co    912103    Government Offices-County
w.stacy65@yahoo.com    amberoptions@binaryoptions.com.co    You are owed
$25,718.19    Others    5704d047e8013500092f7333    800805
1321086091    2016-04-06
08:15:25.000    2016-04-06 07:56:25.000
info@dimes.eu    784102    Video Tapes & Discs-Renting & Leasing
zactopayne@yahoo.com    info@dimes.eu    IMPORTANT!. 5Dimes Account
Information
Recovery    Others    57045dcd2a69890005c879a2    186992    1367767401
   2016-04-06
00:39:57.000    2016-04-06
00:33:49.000    B0074208073@newmailserver.5dom.dom    2016-04-06
00:34:57.000    2016-04-06 00:34:57.000    2016-04-06

On Fri, May 6, 2016 at 4:37 PM, Sree Eedupuganti <sr...@inndata.in> wrote:

> Command : *./mahout recommenditembased
> -Dmapred.input.dir=/user/temp/input.txt
> -Dmapred.output.dir=/user/temp/output --usersFile /user/mahout/users.txt
> --numRecommendations 2 --booleanData --similarityClassname
> SIMILARITY_LOGLIKELIHOOD*
>
> Error:
>  16/05/06 11:00:34 INFO Job: Running job: job_1461844363112_0017
> 16/05/06 11:00:39 INFO Job: Job job_1461844363112_0017 running in uber
> mode : false
> 16/05/06 11:00:39 INFO Job:  map 0% reduce 0%
> 16/05/06 11:00:42 INFO Job: Task Id :
> attempt_1461844363112_0017_m_000000_0, Status : FAILED
> Error: java.lang.NumberFormatException: For input string: "
> leticia9jdqf@gmail.com"
>
> Any suggestions please
> --
> Best Regards,
> Sreeharsha Eedupuganti
> Data Engineer
> innData Analytics Private Limited
>



-- 
Best Regards,
Sreeharsha Eedupuganti
Data Engineer
innData Analytics Private Limited