You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Brian Schultheiss (Jira)" <ji...@apache.org> on 2020/05/21 16:44:00 UTC
[jira] [Created] (ARROW-8880) Make R Binary Install Friendlier
Brian Schultheiss created ARROW-8880:
----------------------------------------
Summary: Make R Binary Install Friendlier
Key: ARROW-8880
URL: https://issues.apache.org/jira/browse/ARROW-8880
Project: Apache Arrow
Issue Type: Improvement
Components: R
Affects Versions: 0.17.1
Environment: Linux (Ubuntu)
Reporter: Brian Schultheiss
When R install tries to run a binary install, it looks for an exact match on the binary version, say "0.17.1.zip" from [https://dl.bintray.com/ursalabs/arrow-r/libarrow/bin/ubuntu-18.04/].
The problem is that even though "0.17.1" is pushed to CRAN as an official release, there is a time period (like right now) where bintray does not have an official binary build, just a date stamped build:
arrow-0.17.0.20200516.zip
arrow-0.17.0.20200517.zip
arrow-0.17.0.20200518.zip
arrow-0.17.0.zip
arrow-0.17.1.20200517.zip
arrow-0.17.1.20200519.zip
arrow-0.17.1.20200520.zip
I'd like to suggest adding a new environment variable trigger that would allow for the scanning of bintray for a recent timestamped version, if the specific release number is not present.
I'd like to suggest enhancing the linux code:
[https://github.com/apache/arrow/blob/02f7be33d1c32d1636323e6fb90c63cb01bf44af/r/tools/linuxlibs.R#L39-L47]
with scanning functionality:
{color:#c1c7d0}try_download <- function(from_url, to_file, scan_dates = FALSE) {{color}
{color:#c1c7d0} try({color}
{color:#c1c7d0} suppressWarnings({color}
{color:#c1c7d0} download.file(from_url, to_file, quiet = quietly){color}
{color:#c1c7d0} ),{color}
{color:#c1c7d0} silent = quietly{color}
{color:#c1c7d0} ){color}
{color:#c1c7d0} if (!file.exists(to_file)) {{color}
{color:#c1c7d0} {color:#0747a6} if (scan_dates) {{color}{color}
{color:#0747a6} scan_dates <- format(Sys.Date()-(0:10),"%Y%m%d"){color}
{color:#0747a6} for (scan_date in scan_dates) {{color}
{color:#0747a6} base_url <- tools::file_path_sans_ext(from_url){color}
{color:#0747a6} ext <- tools::file_ext(from_url){color}
{color:#0747a6} scan_url <- sprintf("%s.%s.%s", base_url, scan_date, ext){color}
{color:#0747a6} if (try_download(from_url = scan_url, to_file, scan_dates = FALSE)) {{color}
{color:#0747a6} return(TRUE){color}
{color:#0747a6} }{color}
{color:#0747a6} }{color}
{color:#0747a6} return(FALSE){color}
{color:#0747a6} } else {{color}
{color:#0747a6} return(FALSE){color}
{color:#0747a6} }{color}
{color:#c1c7d0} } else {{color}
{color:#c1c7d0} return(TRUE){color}
{color:#c1c7d0} }{color}
{color:#c1c7d0}}{color}
And then augment the calling function:
[https://github.com/apache/arrow/blob/02f7be33d1c32d1636323e6fb90c63cb01bf44af/r/tools/linuxlibs.R#L55]
with:
{color:#0747a6}binary_scan_ok <- !identical(tolower(Sys.getenv("LIBARROW_BINARY_SCAN", "false")), "false"){color}
{color:#c1c7d0}if (try_download(binary_url, libfile{color:#0747a6}, {color}{color}{color:#0747a6}scan_dates = binary_scan_ok{color}{color:#c1c7d0})) {{color}
This would allow automated builds to set the scan option, and then find and install the most recent daily build in lieu of an official binary build being in place.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)