Dhiraj RaiHow to pass model objects as arguement to a UDF in pysparkUDF are inevitable when we want to do distributed processing of our data using pyspark.4 min read·Mar 3, 2024----
Dhiraj RaiHow to connect to EC2 with PuTTyStep 1a: To setup an EC2 instance navigate to the AWS EC2 service and click on launch instances.3 min read·Apr 12, 2021----
Dhiraj RaiHow to install custom packages to AWS Lambda — with EC2Lambda is a important service offered by AWS which comes in handy while fully automated and server-less pipelines in AWS cloud.5 min read·Apr 11, 2021----
Dhiraj RaiHow to install custom packages to AWS LambdaLambda is a very useful service offered by AWS, but it does contain all the python packages, that we normally use while python scripting.3 min read·Apr 7, 2021----
Dhiraj RaiJaya Optimization AlgorithmThe Jaya algorithm is a metaheuristic which is capable of solving both constrained and unconstrained optimization problems. It is a…4 min read·Nov 14, 2018--5--5
Dhiraj RaiLogistic Regression in Spark MLThe intent of this blog is to demonstrate binary classification in pySpark. The various steps involved in developing a classification…8 min read·Nov 2, 2018--4--4
Dhiraj RaiFeature Engineering in pyspark — Part IFeature Engineering in pyspark — Part I8 min read·Oct 29, 2018--1--1