About 50 results
Open links in new tab
  1. Comparison operator in PySpark (not equal/ !=) - Stack Overflow

    Aug 24, 2016 · Comparison operator in PySpark (not equal/ !=) Asked 9 years, 5 months ago Modified 1 year, 11 months ago Viewed 165k times

  2. pyspark - How to use AND or OR condition in when in Spark - Stack …

    107 pyspark.sql.functions.when takes a Boolean Column as its condition. When using PySpark, it's often useful to think "Column Expression" when you read "Column". Logical operations on PySpark …

  3. PySpark: multiple conditions in when clause - Stack Overflow

    Jun 8, 2016 · Very helpful observation when in pyspark multiple conditions can be built using & (for and) and | (for or). Note:In pyspark t is important to enclose every expressions within parenthesis () that …

  4. Show distinct column values in pyspark dataframe - Stack Overflow

    With pyspark dataframe, how do you do the equivalent of Pandas df['col'].unique(). I want to list out all the unique values in a pyspark dataframe column. Not the SQL type way (registertemplate the...

  5. cannot resolve column due to data type mismatch PySpark

    Mar 12, 2020 · cannot resolve column due to data type mismatch PySpark Asked 5 years, 10 months ago Modified 4 years, 10 months ago Viewed 39k times

  6. python - Convert pyspark string to date format - Stack Overflow

    Jun 28, 2016 · Convert pyspark string to date format Asked 9 years, 7 months ago Modified 2 years, 5 months ago Viewed 523k times

  7. python - PySpark: "Exception: Java gateway process exited before ...

    I'm trying to run PySpark on my MacBook Air. When I try starting it up, I get the error: Exception: Java gateway process exited before sending the driver its port number when sc = SparkContext() is

  8. Newest 'pyspark' Questions - Stack Overflow

    How to run Pyspark UDF separately over dataframe groups Grouping a Pyspark dataframe, applying time series analysis UDF to each group SOLVED See below I have a Pyspark process which takes a …

  9. pyspark: ValueError: Some of types cannot be determined after inferring

    pyspark: ValueError: Some of types cannot be determined after inferring Asked 9 years, 2 months ago Modified 1 year, 8 months ago Viewed 142k times

  10. Manually create a pyspark dataframe - Stack Overflow

    Sep 16, 2019 · 8 This answer demonstrates how to create a PySpark DataFrame with createDataFrame, create_df and toDF.