scala – Page 2 – Row Coding

How to restrict actor messages to specific types?

November 27, 2023 by Tarik

Then you’d have to encode the message type into the Actor ref, which would drastically decrease the value of something like the ActorRegistry. Also, with powerful mechanics like “become” (which is fundamental to the actor model) typing the messages is less valuable. Since Akka doesn’t leak memory when a message is not matched to the … Read more

Spray, Akka-http and Play, Which is the best bet for a new HTTP/REST project

November 27, 2023 by Tarik

Spray is production ready, but the development team (Mathias Doenitz) works for Typesafe on Akka-http now. The status of Akka-http is “development preview”. There are vague promises of a full release “within a few months”, but nothing you can take to the bank. Edited 29-July-2015: The status of Akka-HTTP is now “release candidate” with version … Read more

Why does Spark fail with “Detected cartesian product for INNER join between logical plans”?

November 27, 2023 by Tarik

You can triggers inner join after turning on the flag spark.conf.set(“spark.sql.crossJoin.enabled”, “true”) You also could also use the cross join. weights.crossJoin(input) or set the Alias as weights.join(input, input(“sourceId”)===weights(“sourceId”), “cross”) You can find more about the issue SPARK-6459 which is said to be fixed in 2.1.1 As you have already used 2.1.1 the issue should have … Read more

Kafka topic creation: Timed out waiting for a node assignment

November 26, 2023 by Tarik

If you’re running Kafka in Docker (or similar) you need to configure the listeners correctly. This article describes it in detail. Here’s an example of a Docker Compose that you can use to access Kafka from your host machine. Disclaimer: I wrote the article 🙂

How to load 100 million records into MongoDB with Scala for performance testing?

November 26, 2023 by Tarik

Some tips : Do not index your collection before inserting, as inserts modify the index which is an overhead. Insert everything, then create index . instead of “save” , use mongoDB “batchinsert” which can insert many records in 1 operation. So have around 5000 documents inserted per batch. You will see remarkable performance gain . … Read more

How to define maven test-jar dependency in sbt

November 26, 2023 by Tarik

“org.apache.hbase” % “hbase” % “0.90.4” % “test” classifier “tests”

scala median implementation

November 26, 2023 by Tarik

Immutable Algorithm The first algorithm indicated by Taylor Leese is quadratic, but has linear average. That, however, depends on the pivot selection. So I’m providing here a version which has a pluggable pivot selection, and both the random pivot and the median of medians pivot (which guarantees linear time). import scala.annotation.tailrec @tailrec def findKMedian(arr: Array[Double], … Read more

scala parallel collections degree of parallelism

November 26, 2023 by Tarik

With the newest trunk, using the JVM 1.6 or newer, use the: collection.parallel.ForkJoinTasks.defaultForkJoinPool.setParallelism(parlevel: Int) This may be a subject to changes in the future, though. A more unified approach to configuring all Scala task parallel APIs is planned for the next releases. Note, however, that while this will determine the number of processors the query … Read more

How to obtain the symmetric difference between two DataFrames?

November 26, 2023 by Tarik

You can always rewrite it as: df1.unionAll(df2).except(df1.intersect(df2)) Seriously though this UNION, INTERSECT and EXCEPT / MINUS is pretty much a standard set of SQL combining operators. I am not aware of any system which provides XOR like operation out of the box. Most likely because it is trivial to implement using other three and there … Read more

How to run sbt multiple command in interactive mode as one command? [duplicate]

November 26, 2023 by Tarik

Within the sbt shell, use ; to chain commands: ;project XXX; assembly Calling from the command line, enclose individual commands with quotes: sbt “project XXX” assembly or enclose a whole chain in quotes: sbt “;project XXX; assembly” To call a task in subproject XXX from the context of another project in the shell: XXX/assembly