Patent application number | Description | Published |
20130345887 | INFRASTRUCTURE BASED COMPUTER CLUSTER MANAGEMENT - Various techniques of managing a computer cluster are disclosed herein. In one embodiment, a method for managing a computer cluster includes receiving a request for a computing operation, obtaining information of utility for the computer cluster, and determining an execution profile of the computing operation identified by the received request based at least in part on the obtained information. The information includes at least one of a configuration or condition of power, heating, cooling, ventilation that supports the computer cluster. The method also includes executing the computing operation in the computer cluster in accordance with the determined execution profile. | 12-26-2013 |
20140189706 | RELIABILITY-AWARE APPLICATION SCHEDULING - Reliability-aware scheduling of processing jobs on one or more processing entities is based on reliability scores assigned to processing entities and minimum acceptable reliability scores of processing jobs. The reliability scores of processing entities are based on independently derived statistical reliability models as applied to reliability data already available from modern computing hardware. Reliability scores of processing entities are continually updated based upon real-time reliability data, as well as prior reliability scores, which are weighted in accordance with the statistical reliability models being utilized. Individual processing jobs specify reliability requirements from which the minimum acceptable reliability score is determined. Such jobs are scheduled on processing entities whose reliability score is greater than or equal to the minimum acceptable reliability score for such jobs. Already scheduled jobs can be rescheduled on other processing entities if reliability scores change. Additionally, a hierarchical scheduling approach can be utilized. | 07-03-2014 |
20140380334 | HARDWARE MANAGEMENT COMMUNICATION PROTOCOL - A simplified hardware management communication protocol comprises defined request packets, which are utilized to transmit requests to lower layers of management functionality or to managed resources, and it also comprises defined response packets, which are utilized to transmit responses back to the source of the request. A request packet comprises an identification of a type of device, an identifier of that device, an address of the sending entity, a session identifier, a sequence number, a function identifier, and a payload that comprises encapsulated communications or data directed to the request target. A response packet can comprise an identification of the sender of the request, a session identifier, a sequence number, a completion code identifying whether and how the request was completed, and a payload. Managed asset type specific drivers translate into communications utilizing communicational protocols that are specific to the managed assets. | 12-25-2014 |
20150134723 | GEO-DISTRIBUTED DISASTER RECOVERY FOR INTERACTIVE CLOUD APPLICATIONS - Disaster recovery is provided for an application that is being hosted on a current data center, thus ensuring the availability of the application. An option for replicating session state data for the application is selected. This selection is made from a set of different session state data replication options each of which has different performance and resource cost trade-offs. The selected option determines how the session state data for the application is to be replicated. The selected option is implemented, where the implementation results in the session state data for the application being replicated outside of the current data center, thus ensuring that this data remains available in the event that the current data center goes offline. | 05-14-2015 |
20150192936 | Datacenter And Cooling Control Fault-Tolerance Using Compute Resources - Management of infrastructure devices is performed by computing devices that are associated with the processing being provided by the data center, such as chassis managers. A master is first selected through polling or consensus algorithms, and then subsequently the master is endowed with the authority to manage infrastructure devices and generate the control output to such infrastructure devices. Alternatively, no master need be elected and, instead, output to such infrastructure devices is generated by a computing device selected utilizing polling or consensus algorithms, and in accordance with a management decision made through polling or consensus algorithms. The interplay between the cooling apparatuses of individual server computing devices and the cooling apparatuses of the data center as a whole is also managed to increase the portion of the cooling of server computing devices provided by data center air movers. Control of data center air movers can be determined empirically or predictively. | 07-09-2015 |
20150227181 | BACKUP POWER MANAGEMENT FOR COMPUTING SYSTEMS - Various techniques for managing power backup for computing devices are disclosed herein. In one embodiment, a method includes receiving data representing a backup capacity of one or more backup power units and data representing a backup power profile of one or more processing units sharing the one or more backup power units. A portion of the backup capacity may then be assigned to each of the one or more processing units based at least in part on both the received data representing the backup capacity of the one or more backup power units and the received data representing the profile of the one or more processing units. | 08-13-2015 |
20150378425 | LOW LATENCY COMPUTER SYSTEM POWER REDUCTION - Technology for handling overcurrent conditions on electrical circuits that power multiple computing modules is disclosed. Aspects of the technology include a power system adapted to provide notifications of overcurrent conditions, and computing modules adapted to reduce an operating speed thereof in response to notification of an overcurrent condition. | 12-31-2015 |
20160118121 | Configurable Volatile Memory Data Save Triggers - A volatile memory data save subsystem may include a coupling to a shared power source such as a chassis or rack battery, or generator. A data save trigger controller sends a data save command toward coupled volatile memory device(s) such as NVDIMMs and PCIe devices under specified conditions: a programmable amount of time passes without AC power, a voltage level drops below normal but is still sufficient to power the volatile memory device during a data save operation, the trigger controller is notified of an operating system shutdown command, or the trigger controller is notified of an explicit data save command without a system shutdown command. NVDIMMs can avoid reliance on dedicated supercapacitors and dedicated batteries. An NVDIMM may perform an asynchronous DRAM reset in response to the data save command. Voltage step downs may be coordinated among power supplies. After data is saved, power cycles and the system reboots. | 04-28-2016 |