Block or report user
  • Cloudera
  • London, United Kingdom
  • Joined on Jun 1, 2011

Organizations

@apache @cloudera @OryxProject

Pinned repositories

  1. apache/spark

    Mirror of Apache Spark

    Scala 11.2k 10.5k

  2. OryxProject/oryx

    Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

    Java 975 264

  3. zxing/zxing

    Official ZXing ("Zebra Crossing") project home

    Java 11.6k 5.9k

962 contributions in the last year

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Mon Wed Fri

Contribution activity First pull request First issue Joined GitHub

December 2016

Created a pull request in apache/spark that received 9 comments

[SPARK-18678][ML] Skewed reservoir sampling in SamplingUtils

What changes were proposed in this pull request? Fix reservoir sampling bias for small k. An off-by-one error meant that the probability of replace…

Seeing something unexpected? Take a look at the GitHub profile guide.