0% found this document useful (0 votes)

229 views32 pages

Report PDF

This document is a project report on developing a warehouse robot. It was created by two students, Khan Saad Bin Hasan and Arpit Varshney, for their Bachelor of Technology degree in Computer Engineering under the guidance of their professor, Mr. Faisal Alam. The report includes declarations by the students, a certificate from their professor, and outlines the table of contents which will discuss the introduction, literature review, tools and technologies used, assembly and circuit diagrams, implementation details, current challenges, and future plans.

Uploaded by

sbdivusbdiu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

229 views32 pages

Report PDF

Uploaded by

sbdivusbdiu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

A Project Report

on
Warehouse Robot

Bachelor of Technology

IN
COMPUTER ENGINEERING

Khan Saad Bin Hasan Arpit Varshney

Enrolment number: GH3571 Enrolment number:GH5314

Under the Guidance of

Mr Faisal Alam
Department Of Computer Engineering

Zakir Husain College of Engineering & Technology

Aligarh Muslim University
Aligarh (India)-202002

1
Dated…………

Declaration

The work presented in project entitle “Warehouse Robot” submitted to the Department of Computer
Engineering, Zakir Husain College of Engineering and Technology, Aligarh Muslim University
Aligarh, for the award of the degree of Bachelor of Technology in Computer Engineering, during the
session 2016-17, is my original work. I have neither plagiarized nor submitted the same work for the
award of any degree.

Date: (Signature)
Place Arpit Varshney

(Signature)
Khan saad Bin Hasan

2
Certificate

This is to certify that the Project Report entitled “Warehouse Robot”, being submitted by “student
name(s)”, in partial fulfillment of the requirements for the award of the degree of Bachelor of
Technology in Computer Engineering, during the session 2016-17, in the Department of Computer
Engineering, Zakir Husain College of Engineering and Technology, Aligarh Muslim University
Aligarh, is a record of candidate’s own work carried out by him under my (our) supervision and
guidance.

Faisal Alam
Assistant Professor
Department of Computer Engineering
ZHCET, AMU, Aligarh

3
Table of Contents

Abstract 6

List of Figures 7

List of Tables 8

Chapter 1 Introduction 9
1.1 Motivation 1 0
1.2 Objectives and Scope 11
1.3 Organisations

Chapter 2 Literature review 12

2.1 Other Autonomous Robots 12
2.1.1 Self Driving Cars 12
2.1.2 Vacuum Cleaners 12
2.2 Warehouse Robots 13
2.2.1 Skypod Vertical Climbing robots 13
2.2.2 Amazon Kiva 13
2.2.3 AGV WEASEL, the "taxi" robot that drives the products 13
2.2.4 Autostore 14
2.2.5 BONUS 14

Chapter 3 Tools and technology Used 15

3.1 Languages and framework 15
3.2 Operating system Used 15
3.3 Algorithms Used 15
3.4 Materials Used 16

Chapter 4 Assembly and Circuit Diagram 19

Chapter 5 Implementation Detail

5.1 Problem Solving Approaches 23
5.1.1 Classical Method 23
5.1.2 Learning Based Approaches 23
5.1.3 Robot Localization 23
5.1.4 Video Streaming with wide field of View 24
5.2 Work done 24
5.2.1 Arduino 24
5.2.2 Raspberry pi 24
5.2.3 Workstation 25
5.3 Object Detection 25

4
5.3.1 YOLOV3 25
5.3.2 How It Works? 26
5.3.3 YOLOV3 Architecture[33] 26
5.4 Transfer Learning 27
5.4.1 Dataset Preparation
5.5 Object Tracking 28
5.5.1 Path Planning 28
5.5.1.2 A Star Algorithm 29

Chapter 6 Current Challenges and Future Plan 30

References 31

5
ABSTRACT

In recent times, we have seen a huge number of robots being used in warehouses to automate mundane
tasks. This helps reduce operation costs and makes the warehouses safer and more efficient. This also
helps take the burden off of human workers and helps them focus on more creative tasks. However, These
robots are not always intelligent hence cannot be used in settings other than they were built for. Intelligent
robots are available but are too costly for most warehouses. We are trying to build a robot that can assist
us in transferring goods from one place to another within a storage facility. This will help us to account
for things also. We want the robot to be autonomous so we reduce the amount of workforce needed.
Backup systems are needed to ensure safety of the goods, other robots and people. The robot must also be
cheap and must be programmable to do multiple tasks if needed.

6
LIST OF FIGURES

Fig 3.1 BO Gear Motor……………………………...……………………………….16

Fig 3.2 LS-298N Motor ……………………………...……………………………….16

Fig 3.3 Power Bank……………………………...………………………………...….16

Fig 3.4 Arduino Uno……………………………...………………………………...….17

Fig 3.5 Raspberry pi 3B……………………………...………………………………..17

Fig 3.6 MG90S Servo Motors ……………………………...………………………...17

Fig 3.7 HCSR04 Ultrasonic Distance Sensor……………………………...………..17

Fig 3.8 Raspberry pi camera……………………………...…………………………..18

Fig 3.9 Logitech Webcam……………………………...………………………………18

Fig 3.10 Offroad Wheels……………………………...………………………………..18

Fig 4.1 Fritzing Diagram……………………………...………………………………..20

Fig 4.2 How the networking is done……………………………...…………………...21

Fig. 4.1 Comparison of YOlOv3 with other Algorithms……………………………...25

Fig. 4.3 Open Labeling Tool Used for data preparation……………………………..27

Fig 5.4 Tracking Color and Drawing Boundary……………………………...……….28

Fig 5.5 Optimized path Between Source and Destination…………………………..28

Fig 5.6 Comparison between A* and Dijkstra Algorithm…………………………….29

7
LIST OF TABLES

Table-4.1 Speed of sound variation with temperature………………………………..21

Table-4.2 Arduino Pin Layout……………………………...……………………………22

Table 5.1 YOLOV3 Architecture……………………………...………………………..26

8
Chapter-1
Introduction

1.1 Motivation

We have seen the rise of large warehouses in modern times especially with the rise of online retail and
ecommerce, the size and number of warehouses has grown considerably. Not long ago warehouses used
manual labor for sorting, managing inventories, transporting, and managing warehouse goods and also to
do dangerous jobs like transporting hazardous substances, climbing to high places, going into dangerous
places within warehouses etc. This incurred a huge cost on the warehouse owners and a huge human cost
as well. This meant that warehouses may be potentially dangerous for the workers and labour was made
to do repetitive tasks while their human intelligence could be utilized to more creative problem solving.

In recent times however this has changed considerably and continues to do so. Many companies like
Alibaba, Amazon etc. have robots transporting their goods autonomously without much human
supervision(or little human supervision). Many companies now build robots that can go up the
warehouses vertically and the new robots are helping redesign the traditional warehouses to provide more
compact and efficient ways to store products. This results in less space being used and potential saving of
expensive real estate.

It has become easier in recent times to acquire large fleets of autonomous robots that can do the heavy
lifting and also robots that can help us with the inventory and management of products. These robots are
autonomous and can also recharge themselves. This can potentially reduce a large percentage of the
warehouse’s workforce resulting in savings of huge amounts of money and saving employees from
potential damage. Also, Unlike most humans these robots can work hours on end without tiring or making
any errors(or very few as compared to humans). Some studies suggest that the initial investments in these
robots can be recovered within a year or two.

However, there is a tradeoff between cheap robots and robots that are somewhat intelligent. Robots that
are cheap can usually do a limited set of tasks efficiently but will fail when given other tasks. This makes
the robots very task specific and in many cases warehouse specific. Hence, Warehouses must contact the
robot supplier to provide the exact robots for their warehouses or make warehouses that suit the particular
robot. This is not suitable for small or old warehouses. Since, small warehouses might not have enough
money to buy specific robots and old warehouses might not work with new robots. These robots use very
simple sensors and actuators and hence are very cheap.

The other case is of intelligent robots, which can adapt to different situations and can be made to do
different tasks via programming or via some softwares. However, to make even mildly intelligent robots

9
we require expensive sensors and actuators like LIDARs, infrared sensors etc. This again might not be
suitable for most of the warehouses since the cost will blow up when buying a large number of robots.
Hence, we have to find a balance between the two approaches- Whether to buy cheap robots that can not
generalize to many tasks or to buy robots that can do many tasks but will be expensive.

1.2 Objective and Scope

We aim to provide the middle path between the two approaches. Our aim is to build a cheap robot that can
generalize to multiple tasks with little modification. This can help small warehouses as well as old
warehouses. Also, it will help warehouses use the same robots for different tasks, Hence, they can replace
robots within their warehouses and switch old robots to smaller tasks if needed. We use cheaper sensors
and alternate sensors wherever possible.

This does lead to another set of problems. Since, the robot is unable to get an accurate understanding of its
position. And we can not deal with dynamic or fast moving objects or sudden changes in the environment.
This however is not always needed in a warehouse-like setting where the setting may not be very dynamic
and exact positions are not always needed.

Our first approach was to use Visual SLAM with multiple cameras mounted on the robot providing a
wide angle view. These images can then be stitched together and used for Visual SLAM. However, we
were unable to complete this since we had low computation power onboard and the lag associated with
sending streams to a high computation power computer was much more than was desirable. Hence, we
had to abandon this approach.

We decided to take the cameras off the robot and use an external camera connected to another computer
with higher computational power. This should work well within a warehouse, since we can mount the
cameras on the ceilings or use the stream from security cameras. We then detected the robot and via
search and planning algorithms gave it a path to move on. We gave commands to it based on that path and
it moved.

Currently, we are trying to build a backup system using ultrasonic sensors and servo motors. We are also
exploring reinforcement learning approaches so we do not have to explicitly program how the robot
should behave. We also plan to add obstacle avoidance and object detection.

10
1.3 Organization

Chapter 1 gives an introduction to the problem, our motivation to solve it and the scope of the solution.
Chapter 2 Discusses various approaches that have been used to solve similar problems.
In Chapter 3 we have given a brief description of the various tools and technologies we have used.
Chapter 4 talks about how we have put together the robot and made the circuit.
In Chapter 5 we have discussed in detail our approach to solving the problem and the various algorithms
used.
Chapter 6 gives an idea as to what problems we are facing currently and what may our future course of
action be.

11
Chapter-2
Literature Review

2.1 Other Autonomous Robots

2.1.1 Self Driving Cars

Gmapping[15] and Hector slam[16] are techniques used to create 2D maps which may be useful but are
not sufficient to get a comprehensive map of the world. Cartographer[17] on the other hand has a lot of
advantages. It is fast and works in real time. It has a great documentation available, it is easy to set up and
shows promising results. Apart from LIDAR based SLAM, vision based SLAM techniques are also
available. They can work even with monocular cameras as shown in [18][19][20] in real time. These
approaches seem promising to build a 3D map of the environment. These techniques promise to be less
resource intensive as compared to LIDAR based techniques and can even be run on mobile phones.

We found the work of wayve ai[25] which is highly based on machine learning techniques. They use
simulation for training their car and then transfer this learning to the real world and provide fine tuning as
needed[26][27]. Waymo uses their custom built “carcraft” [28], truevision ai also has a very good
simulator[29], carla is also a very good simulator[30]. For visualization, even though Rviz is very good
and customizable, Xviz is also available which is customized for self-driving cars[31].

2.1.2 Vacuum Cleaners

iRobot[34] was one of the first companies to come up with innovative commercial floor cleaners.
Roomba 400 series similar to 600 series[38], was one of their first robots with very cheap sensors and a
naive model which was based on randomly moving and hoping that the robot would eventually cover the
entire area. It became an instant hit with the people. Irobot Scooba 450[39] uses iAdapt Responsive
Navigation Technology, which is a highly advanced software system with sensors that allows the robot to
cover the entire floor section with multiple passes. On the other hand, the Braava 380t[40] uses the
NorthStar Navigation System that comes with a stand-alone battery-operated Cube that allows the device
to determine its location in every room and automatically build a map to ensure an efficient cleaning
route. The robot will then be able to make a single pass in every area within its generated configured map
and once it’s done cleaning, it is programmed to return to its starting position.

12
Dyson[36] EYE 360[37] uses a 360 Degree View camera to build a map of the room and navigate
efficiently

2.2 Warehouse Robots

2.2.1 Skypod Vertical Climbing robots[43]

Skypods are warehouse robots that navigate by climbing shelves rather than moving the shelves to human
workers. Skypods scoot around at ground level to find the goods in a particular place, but can then scale
up the shelves when it needs to get up high. They use lasers to navigate around the warehouse, mapping
the space as they go. This also means they don’t collide with each other. Skypod’s free navigation allows
the robots to travel anywhere within the system.

2.2.2 Amazon Kiva[44]

Amazon is one of the biggest online retailers in the world. That means a huge number of packages. It also
has to deliver it fast i.e, within a single day or two. This means the warehouses must be extremely
efficient otherwise the company would suffer huge losses. For this Amazon has its own line of robots.
These robots are not very intelligent rather they work together to get things done. They are monitored by a
central administrator and given commands based on a number of parameters such as, when the package
has to be delivered, when will the delivery truck arrive, etc. The robots do have limited sensors which
may be sensitive to surrounding environmental conditions such as sunlight. They have collision avoidance
sensors, so they can avoid colliding with other robots, humans and other stuff lying around. They navigate
themselves by using QR Codes, which are pasted on the ground. These robots have cameras on their
bottom side which helps them read the QR codes and help them asses where they are in the warehouse.

2.2.3 AGV WEASEL[42]

In contrast to conventional auto-guided transport systems, the WEASEL does not require numerous
sensors or a complex control system. Instead, the WEASEL fleet navigates along an optically designated
lane. This lane can be applied quickly and easily and can also be modified at any time to match your
requirements. The transport orders are generated by manually activated radio commands (stand-alone
solution), third-party systems such as production machines with a PLC, or your material flow system. A
fleet controller manages the orders and assigns them to the corresponding vehicles.

13
2.2.4 Autostore[41]
Autostore is perhaps the most unique concept of robots. It is cheap, efficient and saves a lot of money.
The idea is to have a vertical storage system, hence better utilizing the whole space and not leaving any
space in between shelves. The robots are made to move within grooves of the grills at the top of the
vertical storage hence the need for expensive sensors is minimized. The robots can pick up goods from
below it and then transport it to the proper exit station.

14
Chapter-3
Tools and Technologies Used
3.1 Languages and framework
● Keras is an open-source neural-network library written in Python. It is capable of running on
top of TensorFlow, Microsoft Cognitive Toolkit, R, Theano, or PlaidML. Designed to enable fast
experimentation with deep neural networks, it focuses on being user-friendly, modular, and
extensible.
We have Used Keras to Build the Model for Object Detection.
● Tensorflow is a free and open-source software library for dataflow and differentiable
programming across a range of tasks. It is a symbolic math library, and is also used for machine
learning applications such as neural networks.
We have Used Tensorflow in Transfer Learning.
● Socket programming Computer network programming involves writing computer programs that
enable processes to communicate with each other across a computer network.
Used Python for establishing socket for Providing the video stream and performing computational
task on the remote computer
● Python is an interpreted, high-level, general-purpose programming language.
● Opencv is a library of programming functions mainly aimed at real-time computer vision.Used in
Computer Vision tasks like Object detection , Color Tracking , etc.
● Numpy Used Numpy for the scientific large calculation, like Convolution operation , sliding
window, etc.
● pygame[14] (the library) is a Free and Open Source python programming language library for
making multimedia applications like games built on top of the excellent SDL library. Like SDL,
pygame is highly portable and runs on nearly every platform and operating system.

3.2 Operating system Used

● Noobs is a lightweight operating system that has the capability to run with ease on the less
computation hardware like Raspberrypi , So we have installed the Noobs OS on the Raspberrypi.
● Ubuntu is the operating system that we have installed on our Remote Computer to do the heavy
Computation with the use of socket Programming.

3.3 Algorithms Used

● Yolov3 - Yolo v3 is the Object Detection Algorithm and it is discussed later in this Report.
● I2C[13] is a serial protocol for a two-wire interface to connect low-speed devices like
microcontrollers, EEPROMs, A/D and D/A converters, I/O interfaces and other similar
peripherals in embedded systems.
● A* Algorithm - It is Used to Compute the optimized path between source to destination ,
Avoiding all the obstacles present in their path.It is also discussed later.

15
3.4 Materials Used

Fig 3.1 BO Gear Motor 3.2 LS-298N Motor

● BO Gear motors[1]: These are very popular motors for hobbyists. These motors have gears
inside which facilitate controlled application of power
● LS298n based motor drivers[2]: It is a very versatile motor driver, it uses the popular L298
motor driver IC and has an onboard 5V regulator which it can supply to an external circuit. It can
control up to 4 DC motors, or 2 DC motors with directional and speed control.
● power bank[3]: We use 10000mAh power banks to run motors as well as to power the arduino
and raspberry pi. The current given to motors is via 2.1 A port, to raspberry pi via 2.1 A port and
via 1.5 A port to Arduino.

Fig 3.3 Power Bank

● or low level computing we have used Arduino UNO since it has a large user
Arduino uno[12]: F
base and a supportive community and works with a wide variety of sensors and actuators hence
ideal for our use case.

16
Fig 3.4 Arduino Uno Fig 3.5 Raspberry pi 3B

● or Computer Vision and communication with a higher computation

Raspberry pi 3B[11]: F
power workstation we have used raspberry pi due to the large amount of resources available and
the ease of setup.
● Servo Motors[8]: We have used servo motors to rotate HCSR04 ultrasonic sensors. We used
mg90s servo since we required only a small amount of torque and not violent torques which may
vibrate and damage the robot.

●
●
●
●

Fig 3.6 MG90S Servo Motors Fig 3.7 HCSR04 Ultrasonic Distance Sensor

● HC SR04[9]: It is a cheap and surprisingly accurate sensor which is widely used to calculate
distances. We have used it so we can make a backup system to protect the robot from colliding. It
is mounted on the servos thus giving us a 360 Degree view.
● Cameras: Rpi cam[10] has been used since it is made to work with raspberry pi especially hence
is optimized for performance but we can place only one such camera on rpi hence we have also
used Logitech C270 webcam[7].

17
● Normal sized Exam board[5] has been used as the chassis of the robot, jumper wires[4] are
used to do connection between sensors and actuators, RW 002 off road wheels[6] are the wheels
we have used.

Fig 3.8 Raspberry pi camera Fig 3.9 Logitech Webcam Fig 3.10 Offroad Wheels

18
Chapter-4
Assembly and Circuit Diagram

● Wheels are screwed on the motors which are fixed to the bottom of the board. 4 such motors
along with 4 wheels have been used. 2 motor drivers have been fitted to drive the 4 motors.
● The motors are connected to the motors and wires are taken to the upper part of the chassis to
connect with the arduino. The Arduino is connected to the raspberry pi to enable i2c
communication.
● Cameras are connected to the raspberry pi. The rpi is connected to wifi via a mobile hotspot and
the remote workstation is also connected to the same mobile hotspot so that the remote computer
and the rpi are on the same network.
● The Arduino is also connected to 2 servo motors and 2 ultrasonic sensors.
● The motor drivers are powered by the same 2.1A power supply connected in parallel. One power
bank has been used to power the motors only since on starting the motors exerted more pressure
on the powerbank leading to it turning off. Another powerbank has been used power arduino and
rpi.
● The circuit diagram along with the pins of arduino are as shown in the figures.

19
Fig 4.1 Fritzing Diagram

20
Fig 4.2 How the networking is done

Table-4.1 Speed of sound variation with temperature

21
Table-4.2 Arduino Pin Layout

22
Chapter-5
Implementation Details
5.1 Problem Solving Approaches

There are two approaches that we are considering to solve this problem:

5.1.1 Classical Method

his is based upon the classical robotic technique of using SLAM(Simultaneous Localization and
T
Mapping) for building the map of the environment, Then using this map and using Algorithms like
AMCL(Adaptive Monte Carlo Localization) to localize ourselves within the map and using Robot
Localization to get the location of the robot.This will help us localize ourselves within the map and for
recognizing obstacles we can use Object Detection, This will help us to stay away from things which
might not be considered by SLAM approach. For example to stay at a certain distance from humans,
chairs with wheels and other moving objects.

5.1.2 Learning Based Approaches

These approaches are necessary if we have to deal with chaotic environments with robustness. Also the
Classical approach may be good in constrained environments and environments that are usually static but
may not work so well in other environments. We are looking to implement Reinforcement Learning
based approaches where we will train the robot on real world data i.e, by moving the robot around via
remote control and then test it on the real world environment.

It does not matter which approach we take we have to Implement two things before we can proceed

5.1.3 Robot Localization

It refers to Localizing or estimating where a robot is from a fixed frame of reference. This is usually done
by passing the data coming in from multiple localization sensors passed through a Kalman Filter which
is an algorithm which can take in data from multiple sensors and give the value which would be most
correct based upon the trust on the sensors.

23
5.1.4 Video Streaming with wide field of View
We need to stream the video from the robot to the computer since the algorithms may require a large
amount of computational power to run which the rpi cannot provide. Hence we are using a workstation.
The field of view of the stream must be as large as possible since it would help in better decision making.
SLAM algorithms also require a field of view ~120 Degrees. Also we need stream that has little delay as
possible. We have tried out different things to achieve the above but with little success. We have tried to
use image stitching to stitch together images from different cameras but it didn't work as expected. Also
the lag due to video streaming is much more than we would like.

We are also implementing a backup system that can help save the vehicle from any untoward incident if
things were to go wrong. The arduino can always act as the backup system and should be able to stop the
vehicle in case of any obstacle and if needed can drive the robot to safety.

5.2 Work done

Currently, We are able to move the robot via remote control from a workstation PC or our own laptop and
the robot is able to detect objects. We have done the following:

5.2.1 Arduino
● Wrote the code to move the robot forward, backward, right and left. These are made into
functions so calling right or left will move the robot as such.
● Wrote the code to rotate servo motors and to get distance data from HCSR04 sensors. The reading
of HCSR04 has to be converted based upon the temperature due to the difference in speed of
sound based upon the temperature of the medium. We have used the table shown to calculate the
effect of temperature.
● A Backup Algorithm has also been implemented. It uses the servo motors and ultrasonic sensors
to get the distances in front and back of the robot. If an obstacle is detected which has distance
less than the safety distance the robot is stopped and may backtrack during which if another
obstacle is similarly detected. It is again stopped. It looks around by rotating its servo and tries to
find a space where no obstacle is there. It then rotates itself in that direction and moves along thus
securing the robot.
● The robot can be given command by rpi via I2C communication. These commands are then used
to decide where the robot should move and appropriate commands are given.

5.2.2 Raspberry pi
● Send stream to the workstation over a udp connection.
● Receive command from the workstation, relay the command to the arduino via I2C.

24
5.2.3 Workstation
● The workstation establishes a UDP connection with the rpi and receives the stream.
● The stream can be shown in a pygame window and the user can give commands to it via
keyboard up, down, right, left arrows and space to stop .
● Object Detection is implemented in a different module and it is called to detect objects.
● Images can also be pasted together and shown from the different cameras.

The relevant algorithms and techniques from the above are detailed below.

5.3 Object Detection

Object Detection is a technique to detect the object for the purpose of object avoidance in this project.
There are a certain number of algorithms that are used for object detection , but YOLOv3 is extremely
fast and accurate[33].

Fig. 4.1 Comparison of YOlOv3 with other Algorithms

5.3.1 YOLOV3
You only look once (YOLO) at an image to predict what objects are present and where they are present
using a single convolutional network.

It also it’s previous version but YOLOv3 uses a few tricks to improve training and increase performance,
including: multi-scale predictions, a better backbone classifier, etc.

25
5.3.2 How It Works?
Prior detection systems repurpose classifiers or localizers to perform detection. They apply the model to
an image at multiple locations and scales. High scoring regions of the image are considered detections.

It uses a totally different approach. It applies a single neural network to the full image. This network
divides the image into regions and predicts bounding boxes and probabilities for each region. These
bounding boxes are weighted by the predicted probabilities

5.3.3 YOLOV3 Architecture[33]

Table 4.1 YOLOV3 Architecture

26
5.4 Transfer Learning
Transfer learning is a useful way to quickly retrain YOLOv3 on new data without needing to retrain the
entire network. We accomplish this by starting from the official YOLOv3 weights, and setting each
layer's .requires_grad field to false that we do not want to calculate gradients for and optimize.

5.4.1 Dataset Preparation

The dataset required to train a detector with YOLOv3 contains 2 components: images and labels. Each
image will be associated with a label file (normally a txt file) which defines the object class and
coordinates of object in the image following this syntax: <object-class> <x_center> <y_center> <width>
<height>
To create label files from images, we use a GUI-software for marking bounded boxes of the objects and
generating label files - OpenLabeling.
since you only need to drag and drop a bounding box around your object in the image and the software
will create a label file automatically.

We have figured out the classes that are being present in the dataset and we are currently preparing our
dataset that includes - shoe , wheel , boxes , etc.

Fig. 4.3 Open Labeling Tool Used for data preparation

27
5.5 Object Tracking
Object Tracking is the process of locating a moving object (or multiple objects) over time using a camera,
This technique allows us to track a particular object on the basis of its Colour .

We Used this technique in order to locate the robot in the specified space , so that the robot can move
across the optimized path from source to destination

Fig 5.4 Tracking Color and Drawing Boundary

5.5.1 Path Planning

Path planning is the task of finding a continuous path that will drive the robot from the start to the goal
configuration. The entire path must lie in free space . In path planning the mobile system uses a known
environment map, which is stored in the robot’s memory.
A suitable path is therefore described as a sequence of actions that guide the robot from the start
configuration (state) through some intermediate configurations to the goal configuration.
Which action is chosen in the current state and which state will be next depends on the used path planning
algorithm and used criteria. The algorithm chooses the next most suitable state from the set of all possible
states that can be visited from the current state.

5.5.1.2 A Star Algorithm

A* Search algorithm is one of the best and popular techniques used in path-finding and graph traversals.
Informally speaking, A* Search algorithms, unlike other traversal techniques, it has “brains”. What it
means is that it is really a smart algorithm which separates it from the other conventional algorithms.
In Each Step it decides which step will be closer to the destination , unlike the other Algorithms like
Dijkstra’s Algorithm which finds the next step closer to its previous step.
And it is also worth mentioning that many games like Wildcraft , Tower defense , e tc and web-based
maps use this algorithm to find the shortest path very efficiently (approximation).
Although being the best pathfinding algorithm around, A* Search Algorithm doesn’t produce the shortest
path always, as it relies heavily on heuristics / approximations.

28
Fig 5.5 Optimized path Between Source and Destination

Fig 5.6 Comparison between A* and Dijkstra Algorithm

29
Chapter-6
Current Challenges and Future Plan
● Implementing a backup algorithm that can help us avoid any accident is one of our top
priorities. Currently we have implemented the algorithm and tested with arduino only. However,
the testing with the workstation and rpi along with the planning algorithm remains to be done.
● Lag in stream: We have tried a lot of different techniques for streaming but most result in
significant lags of upto 10s(with logitech camera) and less than 1s with rpi cam. There seems to
be a tradeoff between the quality of video, lag associated and the stillness of video. We wish to
find the optimal way to reduce these.
● Field of View of all the cameras is not more than 60 Degrees while we require around 120
degrees hence we resorted to image stitching which didn’t turn out quite well. Hence, we would
like to explore other possibilities to realize this.
● We wish to work upon our Object Detection model and to collect data for it to make it work
better on our data. Our main objective is to detect obstacles for which we are currently exploring
Machine learning based techniques such as YOLO object detection as well as classical
techniques, such as Haar Cascades.
● We plan to collect a video of the robot moving, so we can use it to train a reinforcement learning
model. This would help us to generalize the model without programming it for many scenarios.
● We are working on implementing the Kalman filter algorithm. It requires us to get data from
multiple sensors. This includes wheel encoders and inertial measurement units. Currently, we
have been able to get data from both, however, mounting them on the robot is a challenge we
would like to address.

30
REFERENCES

[1] robu.in 2020(accessed January 7, 2020) https://robu.in/product/300-rpm-bo-motor-straight/

[2] robu.in 2020(accessed January 7, 2020)
https://robu.in/product/l298n-2a-based-motor-driver-module-good-quality/
[3] flipkart.com 2020(accessed January 7, 2020)
https://www.flipkart.com/urbn-10000-mah-power-bank-upr10k-bl-power/p/itmfbph3arakvvuj
[4] flipkart.com 2020(accessed January 7, 2020)
https://www.flipkart.com/diy-120-pieces-jumper-wire-set-40-m-m-m-f-f-f-wires-multicolor/p/itmf6wspn8wmuwvq
[5] amazon.in 2020(accessed January 7, 2020)
https://www.amazon.in/Navneet-Wooden-Exam-Board-34-5/dp/B07H677VWZ/
[6] robu.in 2020(accessed January 7, 2020) https://robu.in/product/robot-smart-car-wheel-tyre-bo-motor/
[7] logitech.com 2020(accessed January 7, 2020) https://www.logitech.com/en-in/product/hd-webcam-c270
[8] robokits.co.in 2020(accessed January 7, 2020)
https://robokits.co.in/motors/rc-servo-motor/mg90s-metal-gear-servo-motor-9g-premium
[9] robu.in 2020(accessed January 7, 2020) https://robu.in/product/hc-sr04-ultrasonic-range-finder/
[10] electronicscomp.com 2020(accessed January 7, 2020)
https://www.electronicscomp.com/raspberry-pi-8mp-camera-module-india
[11] raspberrypi.org 2020(accessed January 7, 2020) https://www.raspberrypi.org/products/raspberry-pi-3-model-b/
[12] robu.in 2020(accessed January 7, 2020)
https://robu.in/product/atmel-mcu-atmega16u2-mega-2560-r3-improved-version-ch340g-board/
[13] i2c.info 2020(accessed January 9, 2020) https://i2c.info/
[14] pygame.org 2020(accessed January 9, 2020) https://www.pygame.org/wiki/about
[15] http://wiki.ros.org/gmapping (Aug-10-2019)
[16] http://wiki.ros.org/hector_slam (Aug-10-2019)
[17] Hess, W., Kohler, D., Rapp, H., & Andor, D. (2016, May). Real-time loop closure in 2D LIDAR SLAM. In
2016 IEEE International Conference on Robotics and Automation (ICRA) (pp. 1271-1278). IEEE.
[18] Mur-Artal, R., Montiel, J. M. M., & Tardos, J. D. (2015). ORB-SLAM: a versatile and accurate monocular
SLAM system. IEEE transactions on robotics , 31 (5), 1147-1163.
[19] Engel, J., Schöps, T., & Cremers, D. (2014, September). LSD-SLAM: Large-scale direct monocular SLAM. In
European conference on computer vision (pp. 834-849). Springer, Cham.
[20] Davison, A. J. (2003, October). Real-time simultaneous localisation and mapping with a single camera. In Iccv
(Vol. 3, pp. 1403-1410).
[21] http://wiki.ros.org/navigation/ (Aug-10-2019)
[22] Lu, D. V., Hershberger, D., & Smart, W. D. (2014, September). Layered costmaps for context-sensitive
navigation. In 2014 IEEE/RSJ International Conference on Intelligent Robots andSystems (pp. 709-715). IEEE.
[23] Rösmann, C., Feiten, W., Wösch, T., Hoffmann, F., & Bertram, T. (2013, September). Efficient trajectory
optimization using a sparse model. In 2013 European Conference on Mobile Robots (pp. 138-143). IEEE.
[24] Bewley, A., Rigley, J., Liu, Y., Hawke, J., Shen, R., Lam, V. D., & Kendall, A. (2018). Learning to drive from
simulation without real world labels. arXiv preprint arXiv:1812.03823 .
[25] https://wayve.ai/ (Aug-10-2019)
[26] Liu, M. Y., Breuel, T., & Kautz, J. (2017). Unsupervised image-to-image translation networks.In Advances in
neural information processing systems (pp. 700-708).
[27] Xu, H., Gao, Y., Yu, F., & Darrell, T. (2017). End-to-end learning of driving models from large -scale video
datasets. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2174-2182).
[28] https://www.theatlantic.com/technology/archive/2017/08/inside-waymos-secret-testing-and-

31
simulation-facilities/537648/ (Aug-10-2019)
[29] https://www.truevision.ai/ (Aug-10-2019)
[30] Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., & Koltun, V. (2017). CARLA: An open urban driving
simulator. arXiv preprint arXiv:1711.03938 .
[31] https://github.com/uber/xviz (Aug-10-2019)
[32] https://github.com/AlexeyAB/darknet (Sep-10-2019)
[33] Redmon, Joseph, and Ali Farhadi. "Yolov3: An incremental improvement." arXiv preprint arXiv:1804.02767
(2018).
[34]Irobot.com, 'iRobot Corporation: We Are The Robot Company', 2015. [Online]. Available:
http://www.irobot.com/.
[35]Neato, 'Neato Robotics | Smartest, Most Powerful, Best Robot Vacuum', 2015. [Online]. Available:
http://www.neatorobotics.com/.
[36]Dyson.com, 'Latest Dyson Vacuum Cleaner Technology | Dyson.com', 2015. [Online]. Available:
http://www.dyson.com/vacuum-cleaners.aspx.
[37]Dyson 360 Eyeâ„¢ robot, 'Dyson 360 Eyeâ„¢ robot', 2015. [Online]. Available: https://www.dyson360eye.com/.
[38]Irobot.com, 'iRobot Corporation: We Are The Robot Company', 2015. [Online]. Available:
https://www.irobot.in/600-series.aspx
[39]Irobot.com, 'iRobot Corporation: We Are The Robot Company', 2015. [Online]. Available:
https://store.irobot.com/default/scooba-floor-scrubbing/irobot-scooba-450/S450020.html?cgid=us
[40]Irobot.com, 'iRobot Corporation: We Are The Robot Company', 2015. [Online]. Available:
https://store.irobot.com/default/braava-floor-mopping-irobot-braava-floor-mopping/B380020.html
[41] bastiansolutions, (accessed March 17, 2020) https://www.bastiansolutions.com/blog/what-is-autostore/
[42] ssi-schaefer, (accessed March 17, 2020)
https://www.ssi-schaefer.com/en-us/products/conveying-transport/automated-guided-vehicles/fahrerloses-transports
ystem-weasel-53020
[43] exotec, (accessed March 17, 2020) https://www.exotec.com/en/
[44] amazon-robotics, (accessed March 17, 2020) https://www.amazonrobotics.com/#/

DW Assignment Spring - Winter 2021 20 Credit FINAL
No ratings yet
DW Assignment Spring - Winter 2021 20 Credit FINAL
5 pages
AI Basics for Team Collaboration
No ratings yet
AI Basics for Team Collaboration
3 pages
Unit 16
No ratings yet
Unit 16
13 pages
Network Intrusion Data for Researchers
No ratings yet
Network Intrusion Data for Researchers
6 pages
Research Methodology in Computing & Technology: - Using Artificial Intelligence To Detect Security Threats
100% (1)
Research Methodology in Computing & Technology: - Using Artificial Intelligence To Detect Security Threats
8 pages
Report Writing Assignment 3
No ratings yet
Report Writing Assignment 3
21 pages
AI & Expert Systems Course Guide
100% (1)
AI & Expert Systems Course Guide
36 pages
8431-1725072009788-Unit 6 Planning A Computing Project - 2024 Update
No ratings yet
8431-1725072009788-Unit 6 Planning A Computing Project - 2024 Update
90 pages
BTEC - Assignment Brief Unit 22
No ratings yet
BTEC - Assignment Brief Unit 22
4 pages
Cybersecurity & Privacy Research
No ratings yet
Cybersecurity & Privacy Research
5 pages
CHAPTER 2: Making Interactive Systems Feel Natural For Users
No ratings yet
CHAPTER 2: Making Interactive Systems Feel Natural For Users
9 pages
Software Engineering Chapter 3
No ratings yet
Software Engineering Chapter 3
1 page
UoPeople Online Syllabus Repository (OSR) - Home
No ratings yet
UoPeople Online Syllabus Repository (OSR) - Home
4 pages
COT-216 and IT-313 Software Engg
0% (1)
COT-216 and IT-313 Software Engg
18 pages
Thesis Guide Bachelor
No ratings yet
Thesis Guide Bachelor
30 pages
APU Project Log Sheet-1
No ratings yet
APU Project Log Sheet-1
1 page
Software Engihneering Essay
No ratings yet
Software Engihneering Essay
3 pages
Emerging Technologies 2007
No ratings yet
Emerging Technologies 2007
267 pages
Cyberspace News Prediction of Text and Image
No ratings yet
Cyberspace News Prediction of Text and Image
53 pages
A1+SWE4304+Databases+Assessment+Brief+-+2024-25+ 28Sep-24+Cohort 29+ 281 29
No ratings yet
A1+SWE4304+Databases+Assessment+Brief+-+2024-25+ 28Sep-24+Cohort 29+ 281 29
8 pages
Environmental Impact of Digital Transformation
No ratings yet
Environmental Impact of Digital Transformation
59 pages
Soft Computing Assignment
100% (1)
Soft Computing Assignment
13 pages
Document
No ratings yet
Document
3 pages
Formative Assessment-1
No ratings yet
Formative Assessment-1
15 pages
Group Assignment: Interactive Application
100% (1)
Group Assignment: Interactive Application
68 pages
Brkan, Maja - 'Do Algorithms Rule The World - Algorithmic Decision-Making and Data Protection in The Framework of The GDPR and Beyond'
No ratings yet
Brkan, Maja - 'Do Algorithms Rule The World - Algorithmic Decision-Making and Data Protection in The Framework of The GDPR and Beyond'
31 pages
Unit 8 Code: M/508/0494: Innovation and Commercialisation
0% (1)
Unit 8 Code: M/508/0494: Innovation and Commercialisation
68 pages
Computing Tech Design Project
0% (1)
Computing Tech Design Project
9 pages
Final - Report Virtual Law Assistant
No ratings yet
Final - Report Virtual Law Assistant
51 pages
Unit 7 Assingnment1 Template
No ratings yet
Unit 7 Assingnment1 Template
6 pages
Software Development: Exposys Data Labs - (Project Sheet)
No ratings yet
Software Development: Exposys Data Labs - (Project Sheet)
10 pages
Technology Vs Techniques
100% (1)
Technology Vs Techniques
5 pages
Web Based Information Retrieval
No ratings yet
Web Based Information Retrieval
83 pages
ACAD105488 Job Description
No ratings yet
ACAD105488 Job Description
7 pages
1639 GCS1005A NguyenNgocPhu GCS210331 Assignment1
No ratings yet
1639 GCS1005A NguyenNgocPhu GCS210331 Assignment1
21 pages
Academic Success Plan
No ratings yet
Academic Success Plan
3 pages
Anthony Weston CH 1
No ratings yet
Anthony Weston CH 1
17 pages
Paranoia The Psychology of Persecutory Delusions 1st Edition Daniel Freeman Instant Download Full Chapters
100% (2)
Paranoia The Psychology of Persecutory Delusions 1st Edition Daniel Freeman Instant Download Full Chapters
97 pages
Ims555 Grouping Assignment (Ai Deepfakes)
No ratings yet
Ims555 Grouping Assignment (Ai Deepfakes)
23 pages
Probability and Computing Lecture Notes
No ratings yet
Probability and Computing Lecture Notes
252 pages
WDD-IN0719A22H-Nishant Shah-Assignment 1
No ratings yet
WDD-IN0719A22H-Nishant Shah-Assignment 1
29 pages
Report 1
No ratings yet
Report 1
56 pages
Ai Individual Assignment Answer
No ratings yet
Ai Individual Assignment Answer
2 pages
Student-Professional Virtual Bridge
No ratings yet
Student-Professional Virtual Bridge
17 pages
CET351 Assignment 2
No ratings yet
CET351 Assignment 2
8 pages
CET324 Assignment 1 Brief - 2021-2022
No ratings yet
CET324 Assignment 1 Brief - 2021-2022
6 pages
Unit 11 Research Project
No ratings yet
Unit 11 Research Project
16 pages
CET313 Assignment 2021 2022 v1
No ratings yet
CET313 Assignment 2021 2022 v1
6 pages
Project Proposal Format
No ratings yet
Project Proposal Format
8 pages
CyberSecurity FINAL
No ratings yet
CyberSecurity FINAL
12 pages
Web Design & Internet Basics
No ratings yet
Web Design & Internet Basics
25 pages
DT Assignment 2 App
100% (1)
DT Assignment 2 App
5 pages
BOPPPs
0% (1)
BOPPPs
17 pages
PROG191 Pass
No ratings yet
PROG191 Pass
25 pages
Algebra 2 Geometric Sequences and Series PowerPoint 2012
0% (1)
Algebra 2 Geometric Sequences and Series PowerPoint 2012
39 pages
Project Management Assessment Guide
No ratings yet
Project Management Assessment Guide
7 pages
CET333 Project Portfolio Report
No ratings yet
CET333 Project Portfolio Report
13 pages
Augmented Reality Report
No ratings yet
Augmented Reality Report
15 pages
Rep Mulpuprob
No ratings yet
Rep Mulpuprob
21 pages
Design and Implementation Mobile Arduino PDF
100% (1)
Design and Implementation Mobile Arduino PDF
244 pages
Unit-1 Milestone Practice
No ratings yet
Unit-1 Milestone Practice
4 pages
1.12 Anitei Craif Model de Diagnoză A Culturii Organizaţionale
100% (2)
1.12 Anitei Craif Model de Diagnoză A Culturii Organizaţionale
17 pages
Differential Equation
No ratings yet
Differential Equation
6 pages
A. B. C. D.: Answer
No ratings yet
A. B. C. D.: Answer
5 pages
Why Is IMRAD Format Important in A Research Paper
No ratings yet
Why Is IMRAD Format Important in A Research Paper
1 page
Curriculum MAP CIVICS 8 Done
No ratings yet
Curriculum MAP CIVICS 8 Done
6 pages
ACT 323 MATH 417 STAT 417 PROBABILITY AND STOCHASTIC PROCESS - Kabarak University
No ratings yet
ACT 323 MATH 417 STAT 417 PROBABILITY AND STOCHASTIC PROCESS - Kabarak University
4 pages
MKTG5 5th Edition Joe F. Hair Instant Download
No ratings yet
MKTG5 5th Edition Joe F. Hair Instant Download
116 pages
Process of Communication
No ratings yet
Process of Communication
5 pages
Analytical Chemistry-1 (Analytical Chemistry and Working Area)
No ratings yet
Analytical Chemistry-1 (Analytical Chemistry and Working Area)
21 pages
Zenith Discharge Pressure Assembly Spec
No ratings yet
Zenith Discharge Pressure Assembly Spec
1 page
Bss Preliminary Assessment 2024-25
No ratings yet
Bss Preliminary Assessment 2024-25
8 pages
Promote Till Day7
No ratings yet
Promote Till Day7
4 pages
In Class Presentation CHPT 1
No ratings yet
In Class Presentation CHPT 1
26 pages
TB 1-1500-346-20
No ratings yet
TB 1-1500-346-20
17 pages
Business Statistics Assignment 2
No ratings yet
Business Statistics Assignment 2
2 pages
Obiozor - Resume - 14
No ratings yet
Obiozor - Resume - 14
3 pages
Manuale Uso e Manutenzione ENR001-480-EN - 27 - 02 - 17
No ratings yet
Manuale Uso e Manutenzione ENR001-480-EN - 27 - 02 - 17
40 pages
Corning Gorilla Glass 4
100% (1)
Corning Gorilla Glass 4
2 pages
Ce Orientation Interview Reflection
No ratings yet
Ce Orientation Interview Reflection
26 pages
Thesis Topics in Public Administration in Nigeria
100% (3)
Thesis Topics in Public Administration in Nigeria
9 pages
Consumers As Individuals Lec 2
No ratings yet
Consumers As Individuals Lec 2
13 pages
Digital Image Processing - Human Visual System
No ratings yet
Digital Image Processing - Human Visual System
32 pages
Activity 3 My Journey
No ratings yet
Activity 3 My Journey
2 pages
Project Proposal For Youth & Adolescent Empowerment Initiative
No ratings yet
Project Proposal For Youth & Adolescent Empowerment Initiative
14 pages
Student Exploration: Evolution: Mutation and Selection
No ratings yet
Student Exploration: Evolution: Mutation and Selection
7 pages
Guide For The Mechanistic-Empirical Design of New and Rehabilitated Pavement Structures Materials Characterization Is Your Agency Ready?
No ratings yet
Guide For The Mechanistic-Empirical Design of New and Rehabilitated Pavement Structures Materials Characterization Is Your Agency Ready?
13 pages
SSC CGL Strategy
No ratings yet
SSC CGL Strategy
22 pages
UNIT 4: Middle Chil Dhood (The Primary Schooler)
No ratings yet
UNIT 4: Middle Chil Dhood (The Primary Schooler)
12 pages
ChE 2O04 Winter 2016 - Midterm R3
No ratings yet
ChE 2O04 Winter 2016 - Midterm R3
10 pages