You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@openoffice.apache.org by bu...@apache.org on 2013/12/29 05:05:20 UTC

[Bug 123928] New: Hyperlinks not output in CSV output format

https://issues.apache.org/ooo/show_bug.cgi?id=123928

            Bug ID: 123928
        Issue Type: DEFECT
           Summary: Hyperlinks not output in CSV output format
           Product: Calc
           Version: 4.0.1
          Hardware: PC
                OS: Windows 8
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: save-export
          Assignee: issues@openoffice.apache.org
          Reporter: alan.schnell@hawaiiantel.net
                CC: issues@openoffice.apache.org

I have a spreadsheet that contains hyperlinks. I want to output the data in the
spreadsheet (including the hyperlinks) in CSV format. Calc will output the
hyperlinks when saving as an HTML file, XML file or as an ODS file but not when
saving as a CSV file. Since it apparently saves the hyperlinks in all other
output formats but only fails to saves hyperlinks in CSV format; this seems to
be a bug.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

--- Comment #6 from Alan <sc...@juno.com> ---
Created attachment 82192
  --> https://issues.apache.org/ooo/attachment.cgi?id=82192&action=edit
Calc Spreadsheet with two rows of data in two columns

I have attached a zip file with six items. The readme.txt file contains
basically what is in this comment. bugzillaError.jpg shows the error I get when
attempting to attach an .ods file using the "Browse..." function of the
Bugzilla WEB page. The other five files demonstrate the issue.

The spreadsheet testSheet.ods has one row and two columns. The first column
contains text (visible) and a hyperlink (hidden). The second column contains
text.

If I output as XML (using Save As) the hyperlink value is output to the XML
file (testSheet.xml) as shown by this content in the XML file:
    <Cell ss:HRef="http://www.openoffice.org/"><Data ss:Type="String">open
office</Data></Cell>

If I output as HTML (using Save As) the hyperlink value is output to the HTML
file (testSheet.html) as shown by this content in the HTML file:
    <TD WIDTH=84 HEIGHT=18 ALIGN=LEFT><A HREF="http://www.openoffice.org/">open
office</A></TD>

If I output as CSV (using Save As) the hyperlink value is NOT output to the CSV
file (testSheet.csv) as shown by this content in the CSV file:
    open office,some text

Hidden values are being output to XML and HTML as part of the cell contents. So
saying that hidden values are not output is incorrect. They are output. My
issue is that they are not output when the output format is CSV.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

Alan <sc...@juno.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |UNCONFIRMED
         Resolution|NOT_AN_ISSUE                |---

--- Comment #2 from Alan <sc...@juno.com> ---
(In reply to Rainer Bielefeld from comment #1)
> No, it's a characteristic [1][2] of csv, what's something like Plain Text [3]
> 
> [1] <http://en.wikipedia.org/wiki/Comma-separated_values>
> [2] <http://tools.ietf.org/html/rfc4180>
> [3] <http://en.wikipedia.org/wiki/Plain_text>

I am very conversant with data formats. So while the references you provided
were unnecessary, it is nice to know that you were willing to go to the effort
to help someone out.

I was obviously not clear enough in my description of the issue (my fault. I
made an assumption that it would be understood that I knew what CSV was...). 

I was not expecting the hyperlink to be output as a hyperlink in the csv file.
I was expecting the VALUE of the hyperlink to be output as text. The value of
the hyperlink is part of the data within the spreadsheet. When outputting the
data from the spreadsheet, I would expect that the hyperlink value would also
be output.

If the value of the hyperlink is not output, then I have to maintain DUPLICATE
fields in the spreadsheet. One with the hyperlink and a second with the text
value of the hyperlink. While I could name the hyperlink with the same value as
the hyperlink to avoid duplication; this would make the cell content nearly
worthless -- since many URLs provide no clue as to the actual content of the
page.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

--- Comment #3 from Rainer Bielefeld <ra...@bielefeldundbuss.de> ---
@Alan:
Please simply attach a test kit showing your problem.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

Edwin Sharp <el...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |CONFIRMED
             Latest|---                         |4.1-dev
    Confirmation on|                            |
         Issue Type|DEFECT                      |ENHANCEMENT
     Ever confirmed|0                           |1
                 OS|Windows 8                   |All
           Severity|normal                      |trivial

--- Comment #9 from Edwin Sharp <el...@apache.org> ---
For sure not a defect.
I couldn't output both text and link to csv in Excel 2010, Calligra Sheets
2.6.4 and Gnumeric 1.10.16.

AOO410m1(Build:9750)  -  Rev. 1552994
Rev.1552994
Win 7

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

Alan <sc...@juno.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |UNCONFIRMED
         Resolution|NOT_AN_ISSUE                |---

--- Comment #8 from Alan <sc...@juno.com> ---
Ref [1] does not specify that hypertext link values are not output. 

Ref [2] RFC4180 does NOT specify what fields are or are not output in CSV. It
specifies the formatting of the lines and that the content is plain text. 

Ref [3] states: "Files that contain markup or other meta-data are generally
considered plain-text"


So meta-data is considered (by the references that you provided) to be plain
text. The hyperlink value is meta-data for the cell. You output the meta-data
to other formats (XML & HTML) -- you even output cell comments!!

The only explicitly stated exclusion of output from a spreadsheet (in any of
the provided references) is FORMULAs. The hypertext link value is not a
formula, it is TEXT. By refs [1], [2] and [3], the hyperlink value should be
output. 

As to stating that users do not expect the hyperlink value to be output, that
is opinion, not fact. There has been at least one other written complaint about
Calc not outputting the hyperlink value in CSV format. I think you will find
that most people who need the hyperlink value work around the bug by using
either HTML or XML output.

Sorry but failure to output a TEXT value that is meta-data for a spreadsheet
cell is a bug. The hyperlink is text, it is a value of the cell, it is plain
text, it is meta-data, and it is inconsistent that Calc outputs meta-data for
HTML and XML but not CSV.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

Rainer Bielefeld <ra...@bielefeldundbuss.de> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|needmoreinfo                |
             Status|UNCONFIRMED                 |RESOLVED
         Resolution|---                         |NOT_AN_ISSUE

--- Comment #7 from Rainer Bielefeld <ra...@bielefeldundbuss.de> ---
@Alan
Your expectation is wrong, does not match with [1], [2], [3], other users'
expectations.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

Rainer Bielefeld <ra...@bielefeldundbuss.de> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |RESOLVED
                 CC|                            |rainerbielefeld_ooo_qa@biel
                   |                            |efeldundbuss.de
         Resolution|---                         |NOT_AN_ISSUE

--- Comment #1 from Rainer Bielefeld <ra...@bielefeldundbuss.de> ---
No, it's a characteristic [1][2] of csv, what's something like Plain Text [3]

[1] <http://en.wikipedia.org/wiki/Comma-separated_values>
[2] <http://tools.ietf.org/html/rfc4180>
[3] <http://en.wikipedia.org/wiki/Plain_text>

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

Edwin Sharp <el...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|                            |needmoreinfo
                 CC|                            |elish@apache.org

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

--- Comment #5 from Rainer Bielefeld <ra...@bielefeldundbuss.de> ---
I still haven't a clue what reporter's problem might be. I think he simply has
wrong expectations.

> I could not attach the spreadsheet because the Bugzilla "Browse..." function
> won't let me. 

@Alan:
So you do something wrong. But you can send to me sourcedocument.odt and
result.csv by email (simply click my name in a comment), I will attach the
documents for you.

And your expectation seems wrong. As I told you in Comment 1, 'Save as .csv'
only will export the plain text contents, not Hyperlinks, tooltips, Colors,
other "hidden" contents, whatever else. 

"I expect" is nothing what will cause any activity here. My general hint is
that you should read our bug writing guidelines
<http://qa.openoffice.org/issue_handling/pre_submission.html> and  
<http://wiki.openoffice.org/wiki/QA/HowToFileIssue#Principles>, 
unfortunately important information we will need to reproduce your problem is
missing.
Please add all information requested in following:
a) Write a meaningful Summary describing exactly what the problem is
b) Attach a sample document (not only screenshot) or refer to an existing 
  sample document in an other Bug with a link; to attach a file to this 
  bug report, just click on "Add an attachment" right on this page.
  If you want to attach a test kit with multiple documents zip them into
  a single testkit.zip and attach the  testkit.zip
c) Contribute a document related step by step instruction containing every 
  key press and every mouse click how to reproduce your problem 
d) Attach screenshots with comments if you believe that that might explain the 
  problem better than a text comment. Best way is to insert your screenshots
  into a DRAW document and to add comments that explain what you want to show
e) if possible contribute an instruction how to create a sample document 
  from the scratch
f) add information 
  f1) what EXACTLY is unexpected
  f2) and WHY do you believe it's unexpected (cite Help or Documentation!)
  f6) AOO settings that might be related to your problems 
     (video hardware acceleration, ...)
  f9) Whether that worked in former OOO / AOO Versions 
  f10) if a competitor's software (MSO, LibreOffice, ...) fulfills your needs

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

--- Comment #10 from Alan <sc...@juno.com> ---
So, since other spreadsheets handle exporting of meta-data to CSV incorrectly,
OO Calc should too?

Is this like Windows placing the Shut-Down function on the Start menu? Once
someone does it wrong everyone should think it is the right way to do it in the
name of maintaining "compatibility"?

Consider this topic closed. I will do what everyone else does and work around
the ?feature?/?defect?/?bug?. I will output as XML, extract the fields,
reformat it to CSV, and then input the CSV to my other processes.

Thank you for taking the time to respond!! It is appreciated, even though I am
not happy with the outcome.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.

[Bug 123928] Hyperlinks not output in CSV output format

Posted by bu...@apache.org.
https://issues.apache.org/ooo/show_bug.cgi?id=123928

--- Comment #4 from Alan <sc...@juno.com> ---
Created attachment 82188
  --> https://issues.apache.org/ooo/attachment.cgi?id=82188&action=edit
Image of the Calc spreadsheet content

I could not attach the spreadsheet because the Bugzilla "Browse..." function
won't let me. However, the attached JPEG image of the spreadsheet content will
hopefully be sufficient to make this clear. The image is of a spreadsheet with
two rows. The window over the spreadsheet is the hyperlink menu which shows the
value of the hyperlink in the first row, first cell.

I want to output the two rows (of two columns) to CSV format (comma separated
plain text). The first column has a hyperlink and the second column is just
text. The first column has TWO values for each cell. One is the value displayed
in the column and the second (not displayed) is the hyperlink content.

When using "Save As" and outputting to (apparently) anything but CSV, the
hyperlink value is output. For CSV it is not output. The hyperlink value is
part of the content of the cell and should be output (in my opinion...) since
it is data in the spreadsheet and without it the cell content is useless. 

The Calc output of the spreadsheet when using Save As to CSV is:
   open office,some text
   firefox,some other text
and this is useless. The most important content (the hyperlink value) has not
been output.

>From my perspective, the output to CSV should be:
   open office,http://www.openoffice.org/,some text
   firefox,http://www.mozilla.org/en-US/firefox/new/,some other text

or 
   http://www.openoffice.org/,open office,some text
   http://www.mozilla.org/en-US/firefox/new/,firefox,some other text

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.