You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Mingyu Kim (JIRA)" <ji...@apache.org> on 2014/05/16 13:14:51 UTC
[jira] [Commented] (SPARK-1154) Spark fills up disk with app-*
folders
[ https://issues.apache.org/jira/browse/SPARK-1154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13999562#comment-13999562 ]
Mingyu Kim commented on SPARK-1154:
-----------------------------------
I looked at the commit, and it seems like it wipes out app-* based on the last modification time of the directory itself. Because the modification time of a directory only changes when a child is added or removed, this may wipe out the app-* directory of a running Spark application if it has been running for more than TTL (unless new files/jars are added to the app directory once every while). I believe it should check the latest modification time of all the descendents of the app-* directory to decide whether to delete it or not. Am I mistaken?
> Spark fills up disk with app-* folders
> --------------------------------------
>
> Key: SPARK-1154
> URL: https://issues.apache.org/jira/browse/SPARK-1154
> Project: Spark
> Issue Type: Improvement
> Components: Deploy
> Reporter: Evan Chan
> Assignee: Evan Chan
> Priority: Critical
> Labels: starter
> Fix For: 1.0.0
>
>
> Current version of Spark fills up the disk with many app-* folders:
> $ ls /var/lib/spark
> app-20140210022347-0597 app-20140212173327-0627 app-20140218154110-0657 app-20140225232537-0017 app-20140225233548-0047
> app-20140210022407-0598 app-20140212173347-0628 app-20140218154130-0658 app-20140225232551-0018 app-20140225233556-0048
> app-20140210022427-0599 app-20140212173754-0629 app-20140218164232-0659 app-20140225232611-0019 app-20140225233603-0049
> app-20140210022447-0600 app-20140212182235-0630 app-20140218165133-0660 app-20140225232802-0020 app-20140225233610-0050
> app-20140210022508-0601 app-20140212182256-0631 app-20140218165148-0661 app-20140225232822-0021 app-20140225233617-0051
> app-20140210022528-0602 app-20140213000014-0632 app-20140218165225-0662 app-20140225232940-0022 app-20140225233624-0052
> app-20140211024356-0603 app-20140213002026-0633 app-20140218165249-0663 app-20140225233002-0023 app-20140225233631-0053
> app-20140211024417-0604 app-20140213154948-0634 app-20140218172030-0664 app-20140225233056-0024 app-20140225233725-0054
> app-20140211024437-0605 app-20140213171810-0635 app-20140218193853-0665 app-20140225233108-0025 app-20140225233731-0055
> app-20140211024457-0606 app-20140213193637-0636 app-20140218194442-0666 app-20140225233124-0026 app-20140225233733-0056
> app-20140211024517-0607 app-20140214011513-0637 app-20140218194746-0667 app-20140225233133-0027 app-20140225233734-0057
> app-20140211024538-0608 app-20140214012151-0638 app-20140218194822-0668 app-20140225233147-0028 app-20140225233749-0058
> app-20140211193443-0609 app-20140214013134-0639 app-20140218212317-0669 app-20140225233208-0029 app-20140225233759-0059
> app-20140211195210-0610 app-20140214013332-0640 app-20140225180142-0000 app-20140225233215-0030 app-20140225233809-0060
> app-20140211213935-0611 app-20140214013642-0641 app-20140225180411-0001 app-20140225233224-0031 app-20140225233828-0061
> app-20140211214227-0612 app-20140214014246-0642 app-20140225180431-0002 app-20140225233232-0032 app-20140225234719-0062
> app-20140211215317-0613 app-20140214014607-0643 app-20140225180452-0003 app-20140225233239-0033 app-20140226032845-0063
> app-20140211224601-0614 app-20140214184943-0644 app-20140225180512-0004 app-20140225233320-0034 app-20140226033004-0064
> app-20140212022206-0615 app-20140214185118-0645 app-20140225180533-0005 app-20140225233328-0035 app-20140226033119-0065
> app-20140212022226-0616 app-20140214185851-0646 app-20140225180553-0006 app-20140225233354-0036 app-20140226033334-0066
> app-20140212022246-0617 app-20140214222856-0647 app-20140225181115-0007 app-20140225233402-0037 app-20140226033354-0067
> app-20140212043704-0618 app-20140214231312-0648 app-20140225181244-0008 app-20140225233409-0038 app-20140226033538-0068
> app-20140212043724-0619 app-20140214231434-0649 app-20140225182051-0009 app-20140225233416-0039 app-20140226033826-0069
> app-20140212043745-0620 app-20140214231542-0650 app-20140225183009-0010 app-20140225233426-0040 app-20140226034002-0070
> app-20140212044016-0621 app-20140214231616-0651 app-20140225184133-0011 app-20140225233432-0041 app-20140226034053-0071
> app-20140212044203-0622 app-20140214233016-0652 app-20140225184318-0012 app-20140225233439-0042 app-20140226034234-0072
> app-20140212044224-0623 app-20140214233037-0653 app-20140225184709-0013 app-20140225233447-0043 app-20140226034426-0073
> app-20140212045034-0624 app-20140218153242-0654 app-20140225184844-0014 app-20140225233526-0044 app-20140226034447-0074
> app-20140212045119-0625 app-20140218153341-0655 app-20140225190051-0015 app-20140225233534-0045
> app-20140212173310-0626 app-20140218153442-0656 app-20140225232516-0016 app-20140225233540-0046
> This problem is particularly bad if you have a whole bunch of fast jobs.
> Also what makes the problem worse is that any jars for jobs is downloaded into the app-* folder, so that fills up the disk particularly fast.
> I would like to propose two things:
> 1) Spark should have a cleanup thread (or actor) which periodically removes old app-* folders; This should not be the responsibility of people deploying Spark.
> 2) The download of jars should not go to each app-* folder. This wastes a huge amount of space because most jobs use the same jars. Maybe I can open a separate ticket for this.
--
This message was sent by Atlassian JIRA
(v6.2#6252)