错误:spark-shell,回退到在SPARK_HOME下上传库

4
我正在尝试连接Amazon Hadoop的Spark-Shell,但每次都会出现以下错误,我不知道如何修复或配置缺少什么。 spark.yarn.jarsspark.yarn.archive
spark-shell --jars /usr/share/aws/emr/ddb/lib/emr-ddb-hadoop.jar
Setting default log level to "WARN".
To adjust logging level use sc.setLogLevel(newLevel).
16/08/12 07:47:26 WARN Utils: Service 'SparkUI' could not bind on port 4040. Attempting port 4041.
16/08/12 07:47:28 WARN Client: Neither spark.yarn.jars nor spark.yarn.archive is set, falling back to uploading libraries under SPARK_HOME.

谢谢!!!

错误1

我试图运行一个SQL查询,像这样非常简单:

val sqlDF = spark.sql("SELECT col1 FROM tabl1 limit 10")
sqlDF.show()

警告 YarnScheduler: 初始作业未接受任何资源;请检查您的集群 UI,确保工作程序已注册并具有足够的资源

Error2

然后我尝试运行一个Scala脚本,这是一些简单的收集在:https://blogs.aws.amazon.com/bigdata/post/Tx2D93GZRHU3TES/Using-Spark-SQL-for-ETL

import org.apache.hadoop.io.Text;
import org.apache.hadoop.dynamodb.DynamoDBItemWritable
import com.amazonaws.services.dynamodbv2.model.AttributeValue
import org.apache.hadoop.dynamodb.read.DynamoDBInputFormat
import org.apache.hadoop.dynamodb.write.DynamoDBOutputFormat
import org.apache.hadoop.mapred.JobConf
import org.apache.hadoop.io.LongWritable
import java.util.HashMap


var ddbConf = new JobConf(sc.hadoopConfiguration)
ddbConf.set("dynamodb.output.tableName", "tableDynamoDB")
ddbConf.set("dynamodb.throughput.write.percent", "0.5")
ddbConf.set("mapred.input.format.class", "org.apache.hadoop.dynamodb.read.DynamoDBInputFormat")
ddbConf.set("mapred.output.format.class", "org.apache.hadoop.dynamodb.write.DynamoDBOutputFormat")


var genreRatingsCount = sqlContext.sql("SELECT col1 FROM table1 LIMIT 1")

var ddbInsertFormattedRDD = genreRatingsCount.map(a => {
var ddbMap = new HashMap[String, AttributeValue]()

var col1 = new AttributeValue()
col1.setS(a.get(0).toString)
ddbMap.put("col1", col1)

var item = new DynamoDBItemWritable()
item.setItem(ddbMap)

(new Text(""), item)
}
)

ddbInsertFormattedRDD.saveAsHadoopDataset(ddbConf)

scala.reflect.internal.Symbols$CyclicReference: 非法的循环引用,涉及对象 InterfaceAudience 在 scala.reflect.internal.Symbols$Symbol$$anonfun$info$3.apply(Symbols.scala:1502) 和 scala.reflect.internal.Symbols$Symbol$$anonfun$info$3.apply(Symbols.scala:1500) 中发生 在 scala.Function0$class.apply$mcV$sp(Function0.scala:34)


似乎这只是一个警告而非错误。您遇到了什么问题? - Ram Ghadiyaram
1个回答

0

看起来Spark UI未启动,已尝试启动Spark Shell并检查SparkUI localhost:4040 是否正确运行。


网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接