Gadre, US

Akshay Gadre, North Wales, PA US

Patent application number	Description	Published
20140136364	CONFIGURING AND DISPLAYING INTERACTION INFORMATION WITHIN USER INTERFACES - A system and method for configuring, selecting and/or displaying interaction information associated with a product via a web or mobile page is described. In some examples, the system configures and/or selects interaction information based on a type of page, type of information displayed by the page, and/or location within a website that is displaying the interaction information, and displays the interaction information as a graphical influence element or other displayable content via the page.	05-15-2014
20150039461	OMNICHANNEL RETAILING - A device may determine its location within a store and display the location on a map of the store. A user may provide a shopping list to the device. The device may then display the location of desired items on the map of the store. When an item is located in multiple locations, the device may display one or more of the locations for the item. The device may suggest a route through the store by which the user can find all of the items on the shopping list. The device may recommend products to the user based on items near the device, items on the shopping list, or items identified by the user. The device may have the ability to identify items. The device may communicate with the server to gather information about an identified item.	02-05-2015
20150134479	SYSTEMS AND METHODS FOR MULTI-LEVEL RETAILING - An online retailer may have a number of items for sale. The online retailer may enable a reseller to resell one or more of the items for sale. The retailer may provide items to the reseller at a discount. The price paid by a customer may include a markup from the reseller. The reseller may have a reseller code. When a resold item is purchased by a customer, the reseller code may be provided to the retailer. Based on the reseller code, the retailer may determine the price to be charged to the customer. After an item is purchased by a customer, the markup for the reseller may be paid to the reseller. Resellers may be organized in a hierarchical relationship. One or more of the resellers above the reseller corresponding to the reseller code may receive a portion of the markup for the item.	05-14-2015

Aniruddha Gadre, Clyde Hill, WA US

Patent application number	Description	Published
20100268600	ENHANCED ADVERTISEMENT TARGETING - Methods, techniques, and systems for advertisement targeting are provided. Example embodiments provide an enhanced ad targeting system (“EATS”), which given one or more products, determines keywords to associate with those products and which, given one or more entities, determines related products for which ads can be targeted. In some embodiments, the EATS uses semantic analysis and relationship searching to aid in the selection of ads more relevant to a context or search query.	10-21-2010

Aniruddha Gadre, Schenectady, NY US

Patent application number	Description	Published
20110145277	SYSTEMS AND METHODS FOR PERFORMANCE MONITORING AND IDENTIFYING UPGRADES FOR WIND TURBINES - A method for indicating a performance of a wind turbine is provided. Target performance data are created based at least in part on performance data for one or more wind turbines of interest. Baseline performance data are calculated based at least in part on performance data for a plurality of other wind turbines. The target performance data are compared to the baseline performance data to create a relative performance profile. In addition, a graphical representation of the relative performance profile may be created, and an available upgrade for the wind turbines of interest may be evaluated based at least in part on performance data for one or more wind turbines including the available upgrade.	06-16-2011

Aniruddha D. Gadre, Houston, TX US

Patent application number	Description	Published
20150114622	Wellbore Sealing Assembly with Grooves for Enhanced Sealing and Lockdown Capacity - A wellbore system includes a sealing assembly for creating an annular seal between wellbore members. The sealing assembly includes a seal body having a u-shaped cross section, with elongate grooves formed on inner and outer walls thereof. A depth of the grooves varies along a length of the grooves to allow wickers to penetrate the groove by differing degrees when the walls are urged toward the wickers. The wickers can thus provide a high degree of both sealing and lockdown performance.	04-30-2015
20150114667	High Strength Inlay to Improve Lock-Down Capacity in a Wellhead - A wellhead assembly includes a wellhead housing having a bore and a locking profile including a gallery slot, and an annular notch. An inner wellhead assembly is selectively landed in the bore of the wellhead housing, the inner wellhead assembly having a lock ring with a lock ring profile that engages the locking profile. The engaging surface is a sloped downward facing surface at an axially upper end of the gallery slot. The annular notch has a notch engaging profile with a downward facing notch upper shoulder and an upward facing notch lower shoulder. The locking profile includes an inlay, the inlay being located on the notch upper shoulder and the engaging surface.	04-30-2015
20150114668	Flow-By Holes with Gallery and Channel Arrangement on Wellhead and Tubular Hanger - A wellhead assembly includes a tubular wellhead housing having a bore and an annular gallery slot. The annular gallery slot is defined by an enlarged inner diameter of the bore. A tubular hanger is selectively landed in the bore of the wellhead housing, defining an annular cavity between the bore and an outer diameter of the tubular hanger. The tubular hanger is supported by the wellhead housing with a hanger support located in the annular cavity. A flow-by passage is in fluid communication with the annular cavity at locations above and below the hanger support. The flow-by passage intersects with the gallery slot and intersects an outer radial surface of the tubular hanger.	04-30-2015
20150114669	Method and System for Retaining a Lock Ring on a Casing Hanger - A retention system for limiting axial and radial movement of a wellbore lock ring. The retention system includes pins for resisting axial movement of the lock ring and assemblies for limiting radial outward movement of the lock ring. The lock ring circumscribes and couples to a wellbore hanger. The pins project radially from the hanger into the lock ring and into slots, where the slots extend a distance along the inner surface of the lock ring. The assemblies also project radially into the hanger and each have a portion that registers with a channel on a lower end of the lock ring. Lock ring outer radial movement is limited by contact between the portions of the assemblies and inner surfaces of the channels.	04-30-2015

Aniruddha D. Gadre, Rexford, NY US

Patent application number	Description	Published
20090058094	Integrated Medium-Speed Geared Drive Train - A drive train for a wind turbine is provided. The wind turbine comprises a low speed shaft connected to blades of the wind turbine and a higher speed shaft connected to a generator. The drive train also includes a bearing that substantially supports the weight of at least the low speed shaft. A compound planetary gear stage is connected to the low speed shaft and the higher speed shaft, and includes a rotating carrier, a non-rotating ring gear, a plurality of planetary gears, and a rotating sun gear. The sun gear is connected to the higher speed shaft.	03-05-2009
20090140526	STATOR AND STATOR TOOTH MODULES FOR ELECTRICAL MACHINES - An electrical machine comprising a rotor and a stator is provided. The stator includes a plurality of stator tooth modules configured for radial magnetic flux flow. The stator tooth modules include at least one end plate, and the end plates have extensions for mounting onto a stator frame. The stator is concentrically disposed in relation to the rotor of the electrical machine.	06-04-2009
20100133854	COMPACT GEARED DRIVE TRAIN - A drive train for a wind turbine is provided. The wind turbine includes a rotor connected to a low speed shaft, and a low speed shaft connected to a gearbox. The gearbox has a high speed shaft connected to a generator. The drive train includes a bearing interposed between the gearbox and the generator. A gearbox lubrication medium is at least partially contained within the gearbox, and the bearing shares the gearbox lubrication medium with the gearbox.	06-03-2010
20110133456	WIND TURBINE BRAKE POWER GENERATION - A wind turbine, includes a tower for supporting a nacelle; a gearbox connected to an electrical generator arranged in the nacelle; a plurality of blades for rotating the gearbox and driving the generator; a brake disk for stopping rotation of at least one of the gearbox and the generator; and an auxiliary power source, driven by the brake disk, for generating power.	06-09-2011

Patent applications by Aniruddha D. Gadre, Rexford, NY US

Aniruddha Dattatray Gadre, Rexford, NY US

Patent application number	Description	Published
20080199309	Removable bearing arrangement for a wind turbine generator - A wind generator having removable change-out bearings includes a rotor and a stator, locking bolts configured to lock the rotor and stator, a removable bearing sub-assembly having at least one shrunk-on bearing installed, and removable mounting bolts configured to engage the bearing sub-assembly and to allow the removable bearing sub-assembly to be removed when the removable mounting bolts are removed.	08-21-2008

Aniruddha Dattatraya Gadre, Rexford, NY US

Patent application number	Description	Published
20100113210	Split torque compound planetary drivetrain for wind turbine applications - A compound planetary gear transmission unit includes a second stage with axially staggered planets.	05-06-2010
20100310050	ROTATING UNION FOR A LIQUID COOLED ROTATING X-RAY TARGET - A rotating union for an X-ray target is provided. The rotating union for the X-ray target comprises a housing, a coolant-slinging device comprising a rotating shaft having an inner diameter and an outer diameter, a proximal end and a distal end, and a bore therein, one or more slingers coupled to a proximal end of the rotating shaft; a drain annulus coupled to the one or more slingers, wherein the one or more slingers are configured to direct a coolant to the drain annulus and the drain annulus is configured to direct the coolant through a primary coolant outlet; and a stationary tube having a first end and a second end, wherein at least a portion of the stationary tube is disposed within the bore of the rotating shaft.	12-09-2010

Patent applications by Aniruddha Dattatraya Gadre, Rexford, NY US

Ashish Gadre, Kirkland, WA US

Patent application number	Description	Published
20150141037	IMPROVING SCALABILITY AND RELIABILITY OF HARDWARE GEO-FENCING WITH FAILOVER SUPPORT - Systems and methods disclosed herein may include tracking one or more geo-fences using a GNSS hardware processor within a computing device. The tracking may use at least one GNSS signal. State changes of the one or more geo-fences during the tracking may be saved in a shared state database. The shared state database may be shared between the GNSS hardware processor and an application processor within the computing device. Upon detecting a deterioration of the at least one GNSS signal, tracking the one or more geo-fences may be switched from using the GNSS hardware processor to using the application processor. After the switching, an initial state of each of the one or more geo-fences may be set by using states currently stored in the shared state database prior to the switching.	05-21-2015

Ashish Gadre, Redmond, WA US

Patent application number	Description	Published
20130244686	EFFICIENT POWER USAGE IN POSITION TRACKING OPERATIONS - Techniques and tools for reducing power consumption of computing devices (e.g., mobile devices such as mobile phones and tablet computers) that perform position tracking operations are described. In described examples, a low-power processor calculates (e.g., in real time) position information (e.g., GPS position fixes) based on information received from a positioning system (e.g., GPS) and stores the position information for later use in a buffer associated with the low-power processor (e.g., in storage on the low-power processor). Described examples allow position information to be calculated in real time and stored while the device is in a low-power state, and can be used with location-based applications that do not require position information to be delivered to the application in real time.	09-19-2013

Ashish V. Gadre, Rochester Hills, MI US

Patent application number	Description	Published
20110011194	REVERSE CHAIN DRIVE FOR MOTOR VEHICLE TRANSMISSIONS - Reverse gear for a motor vehicle transmission is achieved through a chain assembly having two sprockets and a multi-link chain. One of the sprockets is coupled to a clutch which selectively connects it to an input or output shaft. The other sprocket is directly coupled to the other, i.e., output or input shaft. The multi-link chain carries drive torque from the input shaft to the output shaft when the clutch is engaged. Because the other driving connections between the input shaft and the output shaft associated with the forward gears or gear ratios are through gears which cause a reversal of rotational direction, the driving connection achieved by the chain assembly, without a directional reversal, is, in fact, opposite in direction to the rotational direction of the forward gears, thereby providing reverse.	01-20-2011

Ashwini Gadre, Tustin, CA US

Patent application number	Description	Published
20100166864	MATRIX-BASED PULSE RELEASE PHARMACEUTICAL FORMULATION - The present invention relates to an oral pulse release pharmaceutical composition, which comprises a polymer matrix core, wherein at least one pharmaceutically active ingredient is distributed within the core and on the outer surface of the core. Amphetamine salts, among a number of other pharmaceutically active ingredients, can be formulated as a pharmaceutical composition described herein. The present invention also provides a method for preparing an immediate release component on a solid pharmaceutical formulation.	07-01-2010

Makarand Atulchandra Gadre, Redmond, WA US

Patent application number	Description	Published
20090094609	DYNAMICALLY PROVIDING A LOCALIZED USER INTERFACE LANGUAGE RESOURCE - Technologies are described herein for dynamically providing a localized user interface (“UI”) resource. A localization framework includes a resource manager, resource sets, and resource readers. The resource manager exposes an application programming interface (“API”) to application programs for requesting a localized UI resource from the resource manager. When the resource manager receives a request for a localized UI resource on the API, the resource manager queries the resource sets for the requested resource. If the first resource set is unable to provide the requested localized UI resource, another resource set may be queried. Multiple resource readers within each resource set may also be configured to provide flexibility in how UI resources are loaded and processed.	04-09-2009

Sachin Gadre, Mclean, VA US

Patent application number	Description	Published
20140181306	SYSTEMS AND/OR METHODS FOR SUPPORTING A GENERIC FRAMEWORK FOR INTEGRATION OF ON-PREMISES AND SaaS APPLICATIONS WITH SECURITY, SERVICE MEDIATION, ADMINISTRATIVE, AND/OR MONITORING CAPABILITIES - Certain example embodiments provide a generic integration framework for connecting on-premises applications with software as a service (SaaS) applications, and/or for integrating the same. The framework of certain example embodiments involves a layered approach (including a Connector Development Kit, connection factory, metadata handlers, and connector services) that helps to, among other things, allow customization of applications in multi-tenant architectures. Design-time wizards help create runtime artifacts and, during runtime, the connector service helps serve as an intermediary between the on-premises application and the cloud service, thereby hiding the complexity of the specific cloud providers. Certain example embodiments advantageously provide a generic and well-integrated solution for connecting an on-premises application to a cloud service in connection with existing containers.	06-26-2014

Sameer Gadre, Northville, MI US

Patent application number	Description	Published
20100268530	Signal Pitch Period Estimation - A method and apparatus for estimating the pitch period of a signal. The method includes identifying a first candidate pitch period by performing a search only over a first range of potential pitch periods. The method further includes determining a second candidate pitch period by dividing the first candidate pitch period by an integer, wherein the second candidate pitch period is outside the first range of potential pitch periods. The method further includes selecting as the estimate of the pitch period of the signal the smaller of the candidate pitch periods that is such that portions of the signal separated by that candidate pitch period are well correlated.	10-21-2010
20100281321	Error Concealment - A method and apparatus for selectively replacing damaged portions of a data stream. The method comprises analyzing the data stream to identify damaged portions therein; selecting a damaged portion for replacement; and replacing the selected damaged portion. The selected damaged portion is selected for replacement in dependence on a rate of replacement, the rate of replacement being that at which previous portions of the data stream have been replaced.	11-04-2010
20130159805	Error Concealment - A method and apparatus for selectively replacing damaged portions of a data stream. The method comprises analyzing the data stream to identify damaged portions therein; selecting a damaged portion for replacement; and replacing the selected damaged portion. The selected damaged portion is selected for replacement in dependence on a rate of replacement, the rate of replacement being that at which previous portions of the data stream have been replaced.	06-20-2013

Sameer Arun Gadre, Northville, MI US

Patent application number	Description	Published
20110125491	Speech Intelligibility - The perceived quality of a speech signal is improved by estimating the average power of first and second signal components and applying a first gain factor to the second signal components to generate adjusted second signal components. The first gain factor is selected such that on application of the first gain factor to the second signal components, the ratio of the average power of the first signal components to the average power of the adjusted second signal components would be a first predetermined value, the first predetermined value being such as to inhibit perceptual distortion of the improved speech signal.	05-26-2011
20110125492	Speech Intelligibility - The perceived quality of a narrowband speech signal truncated from a wideband speech signal is improved by generating in a third frequency band third speech components matching first speech components in a first frequency band of the narrowband signal, and generating in a fourth frequency band fourth speech components matching second speech components in a second frequency band of the narrowband signal. A first gain factor is applied to the third speech components to generate adjusted third speech components, and a second gain factor is applied to the fourth speech components to generate adjusted fourth speech components, the gain factors being selected such that the ratios of the average powers of the adjusted third and fourth speech components to the average power of the first speech components are predetermined values.	05-26-2011
20110125494	Speech Intelligibility - The perceived quality of a speech signal output from a user apparatus is improved by storing ambient noise profiles each indicating a model power distribution of a respective ambient noise type as a function of frequency; the ambient noise profile at the user apparatus is measured, the measured ambient noise profile is correlated with each of the stored ambient noise profiles, the stored ambient noise profile is selected with which the measured ambient noise profile is most highly correlated, and the speech signal is manipulated in dependence on which of the stored ambient noise profiles is selected, so as to form an improved speech signal.	05-26-2011

Sarang Gadre, Bear, DE US

Patent application number	Description	Published
20090165646	EFFLUENT GAS RECOVERY PROCESS FOR SILICON PRODUCTION - Effluent gas from a polysilicon reactor is directed to a gas separation membrane with a permeate gas being recycled to the reactor and the retentate being chilled with a cryogenic condenser using liquid cryogen. Liquid cryogen vaporized by the hot effluent gas may be stored or used to seal and/or chill the reactor or blanket a Si feed to a SiHCl3 reactor.	07-02-2009
20090165647	EFFLUENT GAS RECOVERY PROCESS FOR SILICON PRODUCTION - Purified SiHCl3 is used as a sweep gas across a permeate side of a gas separation membrane receiving effluent gas from a polysilicon reactor. The combined sweep gas and permeate is recycled to the reactor.	07-02-2009
20090166173	Effluent gas recovery process for silicon production - Purified SiHCl3 is used as a sweep gas across a permeate side of a gas separation membrane receiving effluent gas from a polysilicon reactor. The combined sweep gas and permeate is recycled to the reactor.	07-02-2009
20090320519	Recovery of Hydrofluoroalkanes - A mixture of air and one or more halogenated alkanes is directed to a gas separation membrane where it is separated into an oxygen, nitrogen, and moisture-enriched and halogenated alkane-depleted permeate and a halogenated alkane-enriched and oxygen, nitrogen, and moisture-depleted retentate. The retentate is directed to a cryogenic condenser where an amount of halogenated alkane is condensed therein.	12-31-2009
20100077796	Hybrid Membrane/Distillation Method and System for Removing Nitrogen from Methane - A hybrid gas separation membrane/cryogenic distillation method and system produces high purity gaseous methane from a gas mixture containing a majority of methane and a minority of nitrogen.	04-01-2010
20100313750	Method and System for Membrane-Based Gas Recovery - A fast gas is recovered from a feed gas containing a fast gas and at least one slow gas using a gas separation membrane. A controller may control a control valve associated with a partial recycle of a permeate gas from the membrane for combining with the feed gas. A controller may control a control valve associated with the backpressure of a residue gas from the membrane.	12-16-2010
20110000257	Effluent Gas Recovery System in Polysilicon and Silane Plants - Purified SiHCl	01-06-2011
20130247761	Method and System for Membrane-Based Gas Recovery - A fast gas is recovered from a feed gas containing a fast gas and at least one slow gas using a gas separation membrane. A controller may control a control valve associated with a partial recycle of a permeate gas from the membrane for combining with the feed gas. A controller may control a control valve associated with the backpressure of a residue gas from the membrane.	09-26-2013
20130255483	Method and System for Membrane-Based Gas Recovery - A fast gas is recovered from a feed gas containing a fast gas and at least one slow gas using a gas separation membrane. A controller may control a control valve associated with a partial recycle of a permeate gas from the membrane for combining with the feed gas. A controller may control a control valve associated with the backpressure of a residue gas from the membrane.	10-03-2013

Patent applications by Sarang Gadre, Bear, DE US

Shirish Gadre, Fremont, CA US

Patent application number	Description	Published
20120198214	N-WAY MEMORY BARRIER OPERATION COALESCING - One embodiment sets forth a technique for N-way memory barrier operation coalescing. When a first memory barrier is received for a first thread group execution of subsequent memory operations for the first thread group are suspended until the first memory barrier is executed. Subsequent memory barriers for different thread groups may be coalesced with the first memory barrier to produce a coalesced memory barrier that represents memory barrier operations for multiple thread groups. When the coalesced memory barrier is being processed, execution of subsequent memory operations for the different thread groups is also suspended. However, memory operations for other thread groups that are not affected by the coalesced memory barrier may be executed.	08-02-2012
20130124838	INSTRUCTION LEVEL EXECUTION PREEMPTION - One embodiment of the present invention sets forth a technique instruction level and compute thread array granularity execution preemption. Preempting at the instruction level does not require any draining of the processing pipeline. No new instructions are issued and the context state is unloaded from the processing pipeline. When preemption is performed at a compute thread array boundary, the amount of context state to be stored is reduced because execution units within the processing pipeline complete execution of in-flight instructions and become idle. If, the amount of time needed to complete execution of the in-flight instructions exceeds a threshold, then the preemption may dynamically change to be performed at the instruction level instead of at compute thread array granularity.	05-16-2013
20130132711	COMPUTE THREAD ARRAY GRANULARITY EXECUTION PREEMPTION - One embodiment of the present invention sets forth a technique instruction level and compute thread array granularity execution preemption. Preempting at the instruction level does not require any draining of the processing pipeline. No new instructions are issued and the context state is unloaded from the processing pipeline. When preemption is performed at a compute thread array boundary, the amount of context state to be stored is reduced because execution units within the processing pipeline complete execution of in-flight instructions and become idle. If, the amount of time needed to complete execution of the in-flight instructions exceeds a threshold, then the preemption may dynamically change to be performed at the instruction level instead of at compute thread array granularity.	05-23-2013
20130166877	SHAPED REGISTER FILE READS - One embodiment of the present invention sets forth a technique for performing a shaped access of a register file that includes a set of N registers, wherein N is greater than or equal to two. The technique involves, for at least one thread included in a group of threads, receiving a request to access a first amount of data from each register in the set of N registers, and configuring a crossbar to allow the at least one thread to access the first amount of data from each register in the set of N registers.	06-27-2013
20130166882	METHODS AND APPARATUS FOR SCHEDULING INSTRUCTIONS WITHOUT INSTRUCTION DECODE - Systems and methods for scheduling instructions without instruction decode. In one embodiment, a multi-core processor includes a scheduling unit in each core for scheduling instructions from two or more threads scheduled for execution on that particular core. As threads are scheduled for execution on the core, instructions from the threads are fetched into a buffer without being decoded. The scheduling unit includes a macro-scheduler unit for performing a priority sort of the two or more threads and a micro-scheduler arbiter for determining the highest order thread that is ready to execute. The macro-scheduler unit and the micro-scheduler arbiter use pre-decode data to implement the scheduling algorithm. The pre-decode data may be generated by decoding only a small portion of the instruction or received along with the instruction. Once the micro-scheduler arbiter has selected an instruction to dispatch to the execution unit, a decode unit fully decodes the instruction.	06-27-2013
20130212364	PRE-SCHEDULED REPLAYS OF DIVERGENT OPERATIONS - One embodiment of the present disclosure sets forth an optimized way to execute pre-scheduled replay operations for divergent operations in a parallel processing subsystem. Specifically, a streaming multiprocessor (SM) includes a multi-stage pipeline configured to insert pre-scheduled replay operations into a multi-stage pipeline. A pre-scheduled replay unit detects whether the operation associated with the current instruction is accessing a common resource. If the threads are accessing data which are distributed across multiple cache lines, then the pre-scheduled replay unit inserts pre-scheduled replay operations behind the current instruction. The multi-stage pipeline executes the instruction and the associated pre-scheduled replay operations sequentially. If additional threads remain unserviced after execution of the instruction and the pre-scheduled replay operations, then additional replay operations are inserted via the replay loop, until all threads are serviced. One advantage of the disclosed technique is that divergent operations requiring one or more replay operations execute with reduced latency.	08-15-2013
20130232322	UNIFORM LOAD PROCESSING FOR PARALLEL THREAD SUB-SETS - One embodiment of the present invention sets forth a technique for processing load instructions for parallel threads of a thread group when a sub-set of the parallel threads request the same memory address. The load/store unit determines if the memory addresses for each sub-set of parallel threads match based on one or more uniform patterns. When a match is achieved for at least one of the uniform patterns, the load/store unit transmits a read request to retrieve data for the sub-set of parallel threads. The number of read requests transmitted is reduced compared with performing a separate read request for each thread in the sub-set. A variety of uniform patterns may be defined based on common access patterns present in program instructions. A variety of uniform patterns may also be defined based on interconnect constraints between the load/store unit and the memory when a full crossbar interconnect is not available.	09-05-2013
20130268715	DYNAMIC BANK MODE ADDRESSING FOR MEMORY ACCESS - One embodiment sets forth a technique for dynamically mapping addresses to banks of a multi-bank memory based on a bank mode. Application programs may be configured to perform read and write a memory accessing different numbers of bits per bank, e.g., 32-bits per bank, 64-bits per bank, or 128-bits per bank. On each clock cycle an access request may be received from one of the application programs and per processing thread addresses of the access request are dynamically mapped based on the bank mode to produce a set of bank addresses. The bank addresses are then used to access the multi-bank memory. Allowing different bank mappings enables each application program to avoid bank conflicts when the memory is accesses compared with using a single bank mapping for all accesses.	10-10-2013
20130311686	MECHANISM FOR TRACKING AGE OF COMMON RESOURCE REQUESTS WITHIN A RESOURCE MANAGEMENT SUBSYSTEM - One embodiment of the present disclosure sets forth an effective way to maintain fairness and order in the scheduling of common resource access requests related to replay operations. Specifically, a streaming multiprocessor (SM) includes a total order queue (TOQ) configured to schedule the access requests over one or more execution cycles. Access requests are allowed to make forward progress when needed common resources have been allocated to the request. Where multiple access requests require the same common resource, priority is given to the older access request. Access requests may be placed in a sleep state pending availability of certain common resources. Deadlock may be avoided by allowing an older access request to steal resources from a younger resource request. One advantage of the disclosed technique is that older common resource access requests are not repeatedly blocked from making forward progress by newer access requests.	11-21-2013
20130311996	MECHANISM FOR WAKING COMMON RESOURCE REQUESTS WITHIN A RESOURCE MANAGEMENT SUBSYSTEM - One embodiment of the present disclosure sets forth an effective way to maintain fairness and order in the scheduling of common resource access requests related to replay operations. Specifically, a streaming multiprocessor (SM) includes a total order queue (TOQ) configured to schedule the access requests over one or more execution cycles. Access requests are allowed to make forward progress when needed common resources have been allocated to the request. Where multiple access requests require the same common resource, priority is given to the older access request. Access requests may be placed in a sleep state pending availability of certain common resources. Deadlock may be avoided by allowing an older access request to steal resources from a younger resource request. One advantage of the disclosed technique is that older common resource access requests are not repeatedly blocked from making forward progress by newer access requests.	11-21-2013
20130311999	RESOURCE MANAGEMENT SUBSYSTEM THAT MAINTAINS FAIRNESS AND ORDER - One embodiment of the present disclosure sets forth an effective way to maintain fairness and order in the scheduling of common resource access requests related to replay operations. Specifically, a streaming multiprocessor (SM) includes a total order queue (TOQ) configured to schedule the access requests over one or more execution cycles. Access requests are allowed to make forward progress when needed common resources have been allocated to the request. Where multiple access requests require the same common resource, priority is given to the older access request. Access requests may be placed in a sleep state pending availability of certain common resources. Deadlock may be avoided by allowing an older access request to steal resources from a younger resource request. One advantage of the disclosed technique is that older common resource access requests are not repeatedly blocked from making forward progress by newer access requests.	11-21-2013
20140165072	TECHNIQUE FOR SAVING AND RESTORING THREAD GROUP OPERATING STATE - A streaming multiprocessor (SM) included within a parallel processing unit (PPU) is configured to suspend a thread group executing on the SM and to save the operating state of the suspended thread group. A load-store unit (LSU) within the SM re-maps local memory associated with the thread group to a location in global memory. Subsequently, the SM may re-launch the suspended thread group. The LSU may then perform local memory access operations on behalf of the re-launched thread group with the re-mapped local memory that resides in global memory.	06-12-2014
20140168245	TECHNIQUE FOR PERFORMING MEMORY ACCESS OPERATIONS VIA TEXTURE HARDWARE - A texture processing pipeline can be configured to service memory access requests that represent texture data access operations or generic data access operations. When the texture processing pipeline receives a memory access request that represents a texture data access operation, the texture processing pipeline may retrieve texture data based on texture coordinates. When the memory access request represents a generic data access operation, the texture pipeline extracts a virtual address from the memory access request and then retrieves data based on the virtual address. The texture processing pipeline is also configured to cache generic data retrieved on behalf of a group of threads and to then invalidate that generic data when the group of threads exits.	06-19-2014
20140173193	TECHNIQUE FOR ACCESSING CONTENT-ADDRESSABLE MEMORY - A tag unit configured to manage a cache unit includes a coalescer that implements a set hashing function. The set hashing function maps a virtual address to a particular content-addressable memory unit (CAM). The coalescer implements the set hashing function by splitting the virtual address into upper, middle, and lower portions. The upper portion is further divided into even-indexed bits and odd-indexed bits. The even-indexed bits are reduced to a single bit using a XOR tree, and the odd-indexed are reduced in like fashion. Those single bits are combined with the middle portion of the virtual address to provide a CAM number that identifies a particular CAM. The identified CAM is queried to determine the presence of a tag portion of the virtual address, indicating a cache hit or cache miss.	06-19-2014
20140173258	TECHNIQUE FOR PERFORMING MEMORY ACCESS OPERATIONS VIA TEXTURE HARDWARE - A texture processing pipeline can be configured to service memory access requests that represent texture data access operations or generic data access operations. When the texture processing pipeline receives a memory access request that represents a texture data access operation, the texture processing pipeline may retrieve texture data based on texture coordinates. When the memory access request represents a generic data access operation, the texture pipeline extracts a virtual address from the memory access request and then retrieves data based on the virtual address. The texture processing pipeline is also configured to cache generic data retrieved on behalf of a group of threads and to then invalidate that generic data when the group of threads exits.	06-19-2014
20140189260	APPROACH FOR CONTEXT SWITCHING OF LOCK-BIT PROTECTED MEMORY - A streaming multiprocessor in a parallel processing subsystem processes atomic operations for multiple threads in a multi-threaded architecture. The streaming multiprocessor receives a request from a thread in a thread group to acquire access to a memory location in a lock-protected shared memory, and determines whether a address lock in a plurality of address locks is asserted, where the address lock is associated the memory location. If the address lock is asserted, then the streaming multiprocessor refuses the request. Otherwise, the streaming multiprocessor asserts the address lock, asserts a thread group lock in a plurality of thread group locks, where the thread group lock is associated with the thread group, and grants the request. One advantage of the disclosed techniques is that acquired locks are released when a thread is preempted. As a result, a preempted thread that has previously acquired a lock does not retain the lock indefinitely.	07-03-2014
20140189329	COOPERATIVE THREAD ARRAY GRANULARITY CONTEXT SWITCH DURING TRAP HANDLING - Techniques are provided for handling a trap encountered in a thread that is part of a thread array that is being executed in a plurality of execution units. In these techniques, a data structure with an identifier associated with the thread is updated to indicate that the trap occurred during the execution of the thread array. Also in these techniques, the execution units execute a trap handling routine that includes a context switch. The execution units perform this context switch for at least one of the execution units as part of the trap handling routine while allowing the remaining execution units to exit the trap handling routine before the context switch. One advantage of the disclosed techniques is that the trap handling routine operates efficiently in parallel processors.	07-03-2014
20140189711	COOPERATIVE THREAD ARRAY GRANULARITY CONTEXT SWITCH DURING TRAP HANDLING - Techniques are provided for restoring thread groups in a cooperative thread array (CTA) within a processing core. Each thread group in the CTA is launched to execute a context restore routine. Each thread group, executes the context restore routine to restore from a memory a first portion of context associated with the thread group, and determines whether the thread group completed an assigned function prior to executing the context restore routine. If the thread group completed an assigned function prior to executing the context restore routine, then the thread group exits the context restore routine. If the thread group did not complete the assigned function prior to executing the context restore routine, then the thread group executes one or more operations associated with a trap handler routine. One advantage of the disclosed techniques is that the trap handling routine operates efficiently in parallel processors.	07-03-2014
20140281679	SELECTIVE FAULT STALLING FOR A GPU MEMORY PIPELINE IN A UNIFIED VIRTUAL MEMORY SYSTEM - One embodiment of the present invention is a parallel processing unit (PPU) that includes one or more streaming multiprocessors (SMs) and implements a selective fault-stalling pipeline. Upon detecting a memory access fault associated with an operation executing on a particular SM, a replay unit in the selective fault-stalling pipeline considers the operation as a faulting operation. Subsequently, instead of notifying the SM of the memory access fault, the replay unit recirculates the operation—reinserting the operation into the selective fault-stalling pipeline. Recirculating faulting operations in such a fashion enables the SM to execute other operation while the replay unit stalls the faulting request until the associated access fault is resolved. Advantageously, the overall performance of the PPU is improved compared to conventional PPUs that, upon detecting a memory access fault, cancel the associated operation and subsequent operations.	09-18-2014
20140372703	SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR WARMING A CACHE FOR A TASK LAUNCH - A system, method, and computer program product for warming a cache for a task launch is described. The method includes the steps of receiving a task data structure that defines a processing task, extracting information stored in a cache warming field of the task data structure, and, prior to executing the processing task, generating a cache warming instruction that is configured to load one or more entries of a cache storage with data fetched from a memory.	12-18-2014
20150103087	SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR DISCARDING PIXEL SAMPLES - A system, method, and computer program product are provided for discarding pixel samples. The method includes the steps of completing shading operations for a pixel set including one or more pixels to generate per-sample shaded attributes according to a shader program executed by a processing pipeline. Discard information for the pixel set is evaluated and one or more per-sample shaded attributes for at least one pixel in the pixel set are discarded based on the evaluated discard information.	04-16-2015
20150113254	EFFICIENCY THROUGH A DISTRIBUTED INSTRUCTION SET ARCHITECTURE - A subsystem is configured to support a distributed instruction set architecture with primary and secondary execution pipelines. The primary execution pipeline supports the execution of a subset of instructions in the distributed instruction set architecture that are issued frequently. The secondary execution pipeline supports the execution of another subset of instructions in the distributed instruction set architecture that are issued less frequently. Both execution pipelines also support the execution of FFMA instructions as well a common subset of instructions in the distributed instruction set architecture. When dispatching a requested instruction, an instruction scheduling unit is configured to select between the two execution pipelines based on various criteria. Those criteria may include power efficiency with which the instruction can be executed and availability of execution units to support execution of the instruction.	04-23-2015

Patent applications by Shirish Gadre, Fremont, CA US

Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees