2007
DOI: 10.1109/tpds.2007.1093
|View full text |Cite
|
Sign up to set email alerts
|

Software-Based Failure Detection and Recovery in Programmable Network Interfaces

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…Periodic device polling for monitoring information about the liveness of hardware or software has been used as fault detection and identification in distributed systems (Bheevgade and Patrikar 2008; Zhou et al 2007; Bhagyashree et al 2010). A technique used at software level for fault identification is known as “heartbeat” where a liveness message is produced by the device mentioning about its correct functioning and working (Ammendola et al 2015), though it has a slight disadvantage of creating extra network traffic.…”
Section: Challenges In Grid Dependabilitymentioning
confidence: 99%
“…Periodic device polling for monitoring information about the liveness of hardware or software has been used as fault detection and identification in distributed systems (Bheevgade and Patrikar 2008; Zhou et al 2007; Bhagyashree et al 2010). A technique used at software level for fault identification is known as “heartbeat” where a liveness message is produced by the device mentioning about its correct functioning and working (Ammendola et al 2015), though it has a slight disadvantage of creating extra network traffic.…”
Section: Challenges In Grid Dependabilitymentioning
confidence: 99%
“…the use of watchdog components, either in hardware or software, is well documented in fault tolerance; this has been applied pervasively to detect faults on distributed systems [6][7][8]. More widely used at a software level is the similar concept of the heartbeats mechanism [9], where devices to be monitored emit a sort of ⟨I'm alive⟩ message [10]; obvious drawback is the generation of network traffic overhead.…”
Section: Related Workmentioning
confidence: 99%
“…In Zhou et al [36], whole packets are copied from the network router to an attached host processor memory. If a fault is detected, the router is restarted with the saved packets.…”
Section: Related Workmentioning
confidence: 99%