You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Jaya Naga Bhavana (JIRA)" <ji...@apache.org> on 2017/11/22 23:08:00 UTC

[jira] [Created] (SOLR-11666) Solr Field with different language values

Jaya Naga Bhavana created SOLR-11666:
----------------------------------------

             Summary: Solr Field with different language values
                 Key: SOLR-11666
                 URL: https://issues.apache.org/jira/browse/SOLR-11666
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
            Reporter: Jaya Naga Bhavana


*This is my python code:*

cmd = "curl localhost:8983/solr/" + core + "/update?commit=true -H 'Content-type:application/json' --data-binary " + "\"[{'id':'" + getLastAddedDocumentID(
            'id') + "','title':{'set':'" + title + "'},'author':{'set':'" + authorNames + "'},'abstract':{'set':'" + abstract + "'}}]\""
        print cmd
        pp = subprocess.Popen(cmd, shell=True, stdout=subprocess.PIPE)
        text, err = pp.communicate()
        print text

*My variable cmd which contains Curl command to add the data to fields is as below:*

curl localhost:8983/solr/test/update?commit=true -H 'Content-type:application/json' --data-binary "[{'id':'15973569-229c-4ce1-83e2-4f5ba543386f','title':{'set':'Bi\-criteria\ Algorithm\ for\ Scheduling\ Jobs\ on\ Cluster\ Platforms\ \*'},'author':{'set':'Pierre\-François\ Dutot\;\ Lionel\ Eyraud\;\ Grégory\ Gr´\;\ Grégory\ Mouní\;\ Denis\ Trystram\;\ '},'abstract':{'set':'We\ describe\ in\ this\ paper\ a\ new\ method\ for\ building\ an\ efficient\ algorithm\ for\ scheduling\ jobs\ in\ a\ cluster.\ Jobs\ are\ considered\ as\ parallel\ tasks\ \(PT\)\ which\ can\ be\ scheduled\ on\ any\ number\ of\ processors.\ The\ main\ feature\ is\ to\ consider\ two\ criteria\ that\ are\ optimized\ together.\ These\ criteria\ are\ the\ makespan\ and\ the\ weighted\ minimal\ average\ completion\ time\ \(minsum\).\ They\ are\ chosen\ for\ their\ complementarity,\ to\ be\ able\ to\ represent\ both\ user\-oriented\ objectives\ and\ system\ administrator\ objectives.\ We\ propose\ an\ algorithm\ based\ on\ a\ batch\ policy\ with\ increasing\ batch\ sizes,\ with\ a\ smart\ selection\ of\ jobs\ in\ each\ batch.\ This\ algorithm\ is\ assessed\ by\ intensive\ simulation\ results,\ compared\ to\ a\ new\ lower\ bound\ \(obtained\ by\ a\ relaxation\ of\ ILP\)\ of\ the\ optimal\ schedules\ for\ both\ criteria\ separately.\ It\ is\ currently\ implemented\ in\ an\ actual\ real\-size\ cluster\ platform.'}}]"

*The field abstract is as below:*

<field name="abstract" type="string" docValues="true" indexed="true" stored="true"/>

*The problem which I face when running this command is:*

Traceback (most recent call last):
  File "F:/pyCalculation/uploadResearchPaper.py", line 196, in <module>
    addDocument(pathToResearchPapersFolder + department + '/', query, department)
  File "F:/pyCalculation/uploadResearchPaper.py", line 188, in addDocument
    pp = subprocess.Popen(cmd, shell=True, stdout=subprocess.PIPE)
  File "C:\Python27\lib\subprocess.py", line 390, in __init__
    errread, errwrite)
  File "C:\Python27\lib\subprocess.py", line 610, in _execute_child
    args = '{} /c "{}"'.format (comspec, args)
UnicodeEncodeError: 'ascii' codec can't encode character u'\xe7' in position 267: ordinal not in range(128)

*And the line at position 276 is:*

'set':'Pierre\-François\ Dutot\;

The problem is with ç this character.

I am confused like Why doesn't solr allow to add this data to the field?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org