• Cursos
  • Compilador de Código
  • Debatir
  • Precios
  • Teams
Menu
0

Pyspark glom()

I do understand that it returns RDD coalescing all elements within each partition into a list. What happens when we don’t specify the num of partition, is there is a default? where do we actually use it?

actionpysparktransformationdataengineering
18th Jun 2024, 3:28 AM
Chethana
Chethana - avatar
1 Respuesta
+ 1
Have you tried looking at the documentation? The glom() method does not have any arguments. https://spark.apache.org/docs/latest/api/JUMP_LINK__&&__python__&&__JUMP_LINK/reference/api/pyspark.RDD.glom.html https://stackoverflow.com/questions/24996302/setting-sparkcontext-for-pyspark https://stackoverflow.com/questions/65489387/whats-the-meaning-of-num-slices-parameter-in-sc-parallelize
18th Jun 2024, 4:20 PM
Tibor Santa
Tibor Santa - avatar

¿Tienes a menudo preguntas como esta?

Aprende gratis de forma más eficaz

  • Introducción a Python

    7,1M de estudiantes

  • Introducción a Java

    4,7M de estudiantes

  • Introducción a C

    1,5M de estudiantes

  • Introducción a HTML

    7,5M de estudiantes

Ver todos los cursos
En tendencia hoy
How to do a responsive page?
1 Votes
Running a python code
1 Votes
Can I make coding projects here and run them without sololearn pro, only in sololearn.
0 Votes
Hey I've done the C# and SQL beginner and intermediate, but still feel like there could be more... Is there advanced somewhere?
0 Votes
How create a new language ?
0 Votes
Is there any debugging practice here or not?
1 Votes
Ai in future
1 Votes
Hola
0 Votes
How To Enable Disable Divs?
0 Votes
Beginner question
0 Votes