Queues and Service Systems Flashcards
What are queues?
Queues are the result of the mismatch between demand and supply.
Queues can be real and virtual.
For example:
- Queuing up to pay money at the supermarket
- Waiting on the phone when customer service put us on hold.
Queues when demand exceeds supply.
In this situation, queues will build up. So we want to find out:
- What is the length of the queue
- How long does the inventory need to wait to be processed
- What is the average time inventory needs to wait to be processed.
How to find the length of the queue?
Queue growth rate = Demand - Capacity
Length of queue at Time, T
= T * (Queue Growth Rate)
= T * (Demand - Capacity)
Time to serve customers arriving at time, T.
Time to serve Qth person in the queue
= Q / Capacity, where Q is the queue length.
Time to serve customer arriving at time, T:
By using the formula to find queue length, we conclude that time to serve the person arrive at time T:
= T * (Demand / Capacity - 1)
Average waiting time during a period T.
Average time to serve a unit
=1/2 * T * (Demand/Capacity - 1) , this T refers to the maximum period of the whole period.
2 factors that will determine average wait time:
- Implied utilization (Demand/Capacity) -> This is a ratio
- Duration of the period T.
Managing Peak Demand (How to reduce demand or increase capacity)
How to reduce demand without reducing revenue?
- Peak-load pricing:
- Charging more during the time period with highest demand. (Compensation) - Off-peak discount
- Offering a discount during a time period with low demand.
(BR)
Adjusting capacity
- Pre-processing strategy:
- Reducing the amount of work needed to process a customer during the peak time period by moving some of the work to an off-peak time.
This will result in less work per customer during peak time and the capacity to serve customers during that time is increased.
Does demand always exceeds capacity?
No, this situation do not last forever. Hence managing peak demand is only temporary.
Queues when demand is lower than supply and demand/service rates are variable - One server
Only one operator/person providing service.
Demand is lower than supply because we are using the flow rate/capacity. At one time, someone might still be waiting but we are looking from the perspective of the process -> What is the process capacity.
As long as the capacity is greater than the demand, we say that demand is lower than supply.
What causes waiting time?
Variability leads to waiting time and inventory.
Variability: Where does it come from?
Inputs:
- Unpredicted volume swings
- Random arrivals
- Incoming quality
- Product mix
Tasks:
- Inherent variation
- Lack of SOP
- Quality (Scrap/rework)
Resources:
- Breakdowns /Maintenance
- Operator absence
- Set-up times
Routes:
- Variable routing
- Dedicated machines
There are countless reasons that leads to variability.
The arrival and service processes
What is arrival process:
- The flow of customers arriving to the system
What is service process:
- The flow of customers when they are being served.
What is average inter-arrival time:
- The average time between customers arrivals to a system (a)
What is average processing time:
- The average time a customer spends with a server (p) -> Service time
Measuring of variability: First source of variability -> The arrival process.
Coefficient of Variation = Standard Deviation / Mean
Measuring of variability: Second source of variability -> The service times
Coefficient of Variation = Standard Deviation of activity time / Average activity time
For exponential distribution, CV = 1., where CV = coefficient of variation.
What is the flow rate, flow time and assumptions made of a queuing model with a single server?
Flow rate, R = 1 / a
Capacity = 1 / p
Inventory in process: Ip
Inventory in queue: Iq
Flow time = Tq + p (Waiting time + Processing time)
Assumptions made:
- All incoming customers only join a single queue.
- When customers arrive, they will wait till the server is completed. They will not just leave because the queue is too long. This means that they will stick through and not leave halfway. (Patient customer)
- The average inter arrival time is greater than the average processing time. There is enough capacity to service the incoming demand.
Utilization
Utilization
= Flow rate / Capacity
= (1 /a) / (1 / p)
= p / a
- p being less than a is assumed in this model.
- This means that the time to process a customer is shorter than the time between customer arrivals - and - the capacity of the server, 1/p, is greater than the demand rate.